Change8

b8740

📦 llama-cppView on GitHub →
1 features

Summary

This release introduces performance improvements on CUDA devices via kernel fusion for multiplications and provides updated binary distributions for numerous platforms and hardware configurations.

✨ New Features

  • Implemented CUDA kernel fusion for multiplications (#21665).