Change8

v0.30.0-rc31

Breaking Changes
📦 ollamaView on GitHub →
1 breaking3 features🐛 1 fixes🔧 5 symbols

Summary

This release shifts the core architecture to directly support llama.cpp and the GGUF format, introduces MLX acceleration for Apple Silicon, and fixes case handling for the nomic-embed-text model.

⚠️ Breaking Changes

  • The underlying architecture has changed from using GGML to directly supporting llama.cpp, which may affect custom integrations or scripts relying on the previous internal structure.

Migration Steps

  1. If you rely on the previous GGML architecture, be aware that it has been replaced by direct llama.cpp support.
  2. Users of `nomic-embed-text` should expect inputs to be lowercased, which may require adjusting downstream processing if mixed-case input was previously relied upon.

✨ New Features

  • Direct support for llama.cpp architecture.
  • Compatibility with the GGUF file format.
  • Use of MLX for accelerated model inference on Apple Silicon.

🐛 Bug Fixes

  • The `nomic-embed-text` model now correctly converts inputs to lowercase as specified by the model card, fixing prior behavior that incorrectly preserved mixed case.

Affected Symbols