What's Changed
- Add support for Qwen2 models (#746)
- bump Neuron SDK to 2.20.2 (#743)
- NeuronX TGI: bump router version to 3.0.0 (#748)
Bug fixes
- training: Fixes consolidation issue when TP is enabled (#739)
- inference: Fix t5 decoder compilation error since Neuron sdk 2.20 (#732)
Full Changelog: v0.0.26...v0.0.27