v0.30.0-rc31

Breaking Changes

📅 May 13, 2026📦 ollamaView on GitHub →

⚠ 1 breaking✨ 3 features🐛 1 fixes🔧 5 symbols

Summary

This release shifts the core architecture to directly support llama.cpp and the GGUF format, introduces MLX acceleration for Apple Silicon, and fixes case handling for the nomic-embed-text model.

⚠️ Breaking Changes

The underlying architecture has changed from using GGML to directly supporting llama.cpp, which may affect custom integrations or scripts relying on the previous internal structure.

Migration Steps

If you rely on the previous GGML architecture, be aware that it has been replaced by direct llama.cpp support.
Users of `nomic-embed-text` should expect inputs to be lowercased, which may require adjusting downstream processing if mixed-case input was previously relied upon.

✨ New Features

Direct support for llama.cpp architecture.
Compatibility with the GGUF file format.
Use of MLX for accelerated model inference on Apple Silicon.

🐛 Bug Fixes

The `nomic-embed-text` model now correctly converts inputs to lowercase as specified by the model card, fixing prior behavior that incorrectly preserved mixed case.

Affected Symbols

llama.cpp GGML GGUF MLX nomic-embed-text