v0.30.9
📦 ollamaView on GitHub →
✨ 1 features🐛 2 fixes🔧 1 symbols
Summary
This release introduces support for the Cohere2Moe architecture and resolves several bugs related to LFM2 parsing and token output limitations in agent use cases. Ollama now enforces context window limits on single messages.
✨ New Features
- Support for Cohere2Moe architecture
🐛 Bug Fixes
- Fixed LFM2 parser/render for cases where thinking was not emitted
- Fixed issue where `ollama launch claude` and other coding agent or assistant use cases would only output one token