v0.30.0-rc31
Breaking Changes📦 ollamaView on GitHub →
⚠ 1 breaking✨ 3 features🐛 1 fixes🔧 5 symbols
Summary
This release shifts the core architecture to directly support llama.cpp and the GGUF format, introduces MLX acceleration for Apple Silicon, and fixes case handling for the nomic-embed-text model.
⚠️ Breaking Changes
- The underlying architecture has changed from using GGML to directly supporting llama.cpp, which may affect custom integrations or scripts relying on the previous internal structure.
Migration Steps
- If you rely on the previous GGML architecture, be aware that it has been replaced by direct llama.cpp support.
- Users of `nomic-embed-text` should expect inputs to be lowercased, which may require adjusting downstream processing if mixed-case input was previously relied upon.
✨ New Features
- Direct support for llama.cpp architecture.
- Compatibility with the GGUF file format.
- Use of MLX for accelerated model inference on Apple Silicon.
🐛 Bug Fixes
- The `nomic-embed-text` model now correctly converts inputs to lowercase as specified by the model card, fixing prior behavior that incorrectly preserved mixed case.