v0.10.0

Breaking Changes

📅 Jul 18, 2025📦 ollama

⚠ 1 breaking✨ 5 features🐛 3 fixes🔧 7 symbols

Summary

Ollama v0.10.0 introduces a new desktop app, significant performance optimizations for gemma3n and multi-GPU setups, and critical fixes for tool calling and API image support.

⚠️ Breaking Changes

Parallel request processing now defaults to 1. This may impact throughput for concurrent users unless manually reconfigured.

Migration Steps

If your workflow relies on high concurrency, review the parallel request settings as the default has changed to 1.
Download the new app version for macOS or Windows from the official site.

✨ New Features

Added context length display to the 'ollama ps' command.
Improved performance for gemma3n models by 2-3x.
Improved multi-GPU performance by 10-30%.
OpenAI-compatible API now supports WebP images.
Enhanced error display in 'ollama run' for better user feedback.

🐛 Bug Fixes

Fixed tool calling issues with granite3.3 and mistral-nemo models.
Fixed tool calling logic error where overlapping tool names (e.g., 'add' vs 'get_address') caused failures.
Fixed an issue where 'ollama show' would incorrectly report an error.

🔧 Affected Symbols

ollama psollama showollama rungemma3ngranite3.3mistral-nemoOpenAI-compatible API