v5.8.1
📦 transformersView on GitHub →
🐛 4 fixes🔧 2 symbols
Summary
Patch release v5.8.1 primarily focuses on fixing critical issues related to the Deepseek V4 integration and improving serving stability.
🐛 Bug Fixes
- Added fatal_error to ContinuousBatchingManager for serving stability.
- Corrected WeightConverter regex to prevent incorrect matching of shared_experts as experts.
- Fixed issues related to Deepseek v4 integration.
- Resolved Deepseek v4 csa mask collapse issue.