b9286
📦 llama-cppView on GitHub →
✨ 1 features🔧 1 symbols
Summary
This release introduces Q8_0 quantization support for the ggml-zendnn backend and includes synchronization updates for that backend. Various pre-compiled binaries for different operating systems and hardware configurations are provided.
✨ New Features
- Added Q8_0 quantization support for ggml-zendnn backend.