v0.30.0-rc20
📦 ollamaView on GitHub →
✨ 3 features
Summary
This pre-release updates the architecture to directly support llama.cpp and the GGUF format, while introducing MLX acceleration for Apple Silicon inference.
Migration Steps
- When installing on Mac/Linux, use OLLAMA_VERSION=0.30.0-rc20 in the install script.
- When installing on Windows, set $env:OLLAMA_VERSION="0.30.0-rc20" before running the install script.
✨ New Features
- Architecture changed to directly support llama.cpp instead of building on top of GGML.
- Added compatibility with GGUF file format.
- MLX is now used to accelerate model inference on Apple Silicon.