v0.17.1-rc2
📦 ollamaView on GitHub →
✨ 3 features🐛 2 fixes
Summary
This release introduces support for the nemotron architecture and includes several performance and logging improvements, particularly for MLX-based operations. It also updates underlying MLX-C bindings.
Migration Steps
- Update mlx-c bindings to version 0.5.0.
✨ New Features
- Added support for the nemotron model architecture.
- Implemented capability-based detection for web search in the UI.
- Added upgrade configuration option to the settings page.
🐛 Bug Fixes
- Reduced log noise and fixed duplicate log prefixes in mlxrunner.
- Stopped defaulting to affine quantization for models that are not quantized.