b9573
📦 llama-cppView on GitHub →
🐛 1 fixes🔧 1 symbols
Summary
This release primarily addresses a bug fix related to attention key/value length in plamo2 models and provides updated pre-compiled binaries for various platforms.
🐛 Bug Fixes
- Fixed a regression in plamo2 attention key/value length calculation.