b8115
📦 llama-cppView on GitHub →
Summary
This release primarily focuses on providing pre-built binaries for various operating systems and hardware configurations, including specific CUDA versions (12.4 and 13.1) for Windows, and includes tests for matrix multiplication with huge batch sizes.