Change8

b9573

📦 llama-cppView on GitHub →
🐛 1 fixes🔧 1 symbols

Summary

This release primarily addresses a bug fix related to attention key/value length in plamo2 models and provides updated pre-compiled binaries for various platforms.

🐛 Bug Fixes

  • Fixed a regression in plamo2 attention key/value length calculation.

Affected Symbols