v0.30.0-rc21

Breaking Changes

📅 May 13, 2026📦 ollamaView on GitHub →

⚠ 1 breaking✨ 3 features

Summary

This pre-release version overhauls the architecture to use llama.cpp directly, enabling GGUF support and leveraging MLX for Apple Silicon acceleration. Users are encouraged to provide feedback on performance and stability.

⚠️ Breaking Changes

The underlying architecture has changed from using GGML to directly supporting llama.cpp, which may affect custom integrations or tooling relying on the previous internal structure.

Migration Steps

If you rely on the previous GGML architecture, be aware that it has been replaced by direct llama.cpp support.
To install this specific pre-release version (0.30.0-rc21) on Mac/Linux, use: curl -fsSL https://ollama.com/install.sh | OLLAMA_VERSION=0.30.0-rc21 sh
To install this specific pre-release version (0.30.0-rc21) on Windows, use: $env:OLLAMA_VERSION="0.30.0-rc21"; irm https://ollama.com/install.ps1 | iex

✨ New Features

Direct support for llama.cpp architecture.
Compatibility with the GGUF file format is now enabled.
MLX is utilized to accelerate model inference on Apple Silicon devices.