Change8

b8644

📦 llama-cppView on GitHub →
🐛 1 fixes

Summary

This release reverts a previous commit related to SWA KV cache quantization and provides updated binary distributions for macOS, Linux, Windows, and openEuler targeting various CPU/GPU backends.

🐛 Bug Fixes

  • Reverted a change that quantized the SWA KV cache.