Skip to content

v0.0.27: Qwen2 models, Neuron SDK 2.20.2

Latest
Compare
Choose a tag to compare
@dacorvo dacorvo released this 13 Dec 10:00

What's Changed

  • Add support for Qwen2 models (#746)
  • bump Neuron SDK to 2.20.2 (#743)
  • NeuronX TGI: bump router version to 3.0.0 (#748)

Bug fixes

  • training: Fixes consolidation issue when TP is enabled (#739)
  • inference: Fix t5 decoder compilation error since Neuron sdk 2.20 (#732)

Full Changelog: v0.0.26...v0.0.27