v0.11.0
📦 ollama
✨ 8 features🐛 1 fixes🔧 5 symbols
Summary
Ollama v0.11 introduces support for OpenAI's gpt-oss models (20B and 120B) featuring native MXFP4 quantization, agentic capabilities, and configurable reasoning effort.
Migration Steps
- Download and install Ollama version v0.11 from the official website.
- Run 'ollama run gpt-oss:20b' or 'ollama run gpt-oss:120b' to pull the new models.
✨ New Features
- Support for OpenAI gpt-oss 20B and 120B open weight models.
- Native support for MXFP4 (4.25 bits per parameter) quantization format.
- Built-in web search tool for model augmentation.
- Support for native function calling, python tool calls, and structured outputs.
- Full chain-of-thought access for model reasoning transparency.
- Configurable reasoning effort levels (low, medium, high).
- New optimized kernels for the MXFP4 format in the Ollama engine.
- kvcache: Enable Sliding Window Attention (SWA) to retain additional entries.
🐛 Bug Fixes
- kvcache: Added logging for cache contents when a slot cannot be found.
🔧 Affected Symbols
gpt-oss:20bgpt-oss:120bMXFP4kvcacheSWA