v3.2.1
📦 tgiView on GitHub →
✨ 2 features🐛 2 fixes🔧 2 symbols
Summary
This release introduces support for the Gemma 3 text model type and the official release of the Gaudi Backend. It also includes necessary updates for Triton kernel compilation and various bug fixes.
Migration Steps
- If using Triton kernels, ensure `gcc` is available during runtime for kernel compilation.
✨ New Features
- Router now supports the `gemma3-text` model type.
- Release of Gaudi Backend for TGI.
🐛 Bug Fixes
- Fixed an issue related to Sliding Window Attention.
- Launcher now correctly gets the head dimension for VLMs.
🔧 Affected Symbols
kernelslauncher