b8500
📦 llama-cppView on GitHub →
✨ 1 features🔧 1 symbols
Summary
This release introduces new Fast Attention (FA) instantiations for specific HSK and HSV values within the metal backend and provides updated binary distributions across various platforms.
✨ New Features
- Added FA instantiations for HSK=512 and HSV=512 in the metal backend.