b8740
📦 llama-cppView on GitHub →
✨ 1 features
Summary
This release introduces performance improvements on CUDA devices via kernel fusion for multiplications and provides updated binary distributions for numerous platforms and hardware configurations.
✨ New Features
- Implemented CUDA kernel fusion for multiplications (#21665).