v0.13.0

📅 Nov 19, 2025📦 ollamaView on GitHub →

✨ 7 features🐛 2 fixes🔧 7 symbols

Summary

This release introduces support for DeepSeek-OCR, Cogito-V2.1, and DeepSeek-V3.1 architecture, alongside a new performance benchmarking tool and significant engine optimizations for KV caching and GPU detection.

Migration Steps

To use the new bench tool, install Go and run 'go run ./cmd/bench -model <model_name>' from the root of the Ollama repository.

✨ New Features

Added support for DeepSeek-OCR model for high-precision document processing and optical 2D mapping.
Added support for Cogito-V2.1 instruction-tuned generative models.
Added support for DeepSeek-V3.1 architecture in the Ollama engine.
Introduced a new 'bench' tool in the GitHub repository for testing model performance.
Improved KV cache performance to eliminate the need for defragmentation.
Improved CPU and memory detection within containers and cgroups.
Improved VRAM information detection for AMD GPUs.

🐛 Bug Fixes

Fixed performance regressions on CUDA introduced in version 0.12.11.
Fixed missing Vulkan libraries in Linux installation packages.

🔧 Affected Symbols

DeepSeek-OCRCogito-V2.1DeepSeek-V3.1cmd/benchCUDAVulkanKV cache