v0.30.8
📦 ollamaView on GitHub →
✨ 1 features🐛 2 fixes🔧 1 symbols
Summary
This release includes improvements to MLX MTP caching, hardening of mlxrunner layers, and a fix for launch provider drift. Prompt caching logic has also been decoupled from context shifting.
✨ New Features
- Decoupled prompt caching from context shift.
🐛 Bug Fixes
- Hardened linear/embedding layers in mlxrunner against over-promotion.
- Fixed launch provider drift.