b8644
📦 llama-cppView on GitHub →
🐛 1 fixes
Summary
This release reverts a previous commit related to SWA KV cache quantization and provides updated binary distributions for macOS, Linux, Windows, and openEuler targeting various CPU/GPU backends.
🐛 Bug Fixes
- Reverted a change that quantized the SWA KV cache.