v0.30.0-rc20

📅 May 13, 2026📦 ollamaView on GitHub →

✨ 3 features

Summary

This pre-release updates the architecture to directly support llama.cpp and the GGUF format, while introducing MLX acceleration for Apple Silicon inference.

Migration Steps

When installing on Mac/Linux, use OLLAMA_VERSION=0.30.0-rc20 in the install script.
When installing on Windows, set $env:OLLAMA_VERSION="0.30.0-rc20" before running the install script.

✨ New Features

Architecture changed to directly support llama.cpp instead of building on top of GGML.
Added compatibility with GGUF file format.
MLX is now used to accelerate model inference on Apple Silicon.