Change8

b8115

📦 llama-cppView on GitHub →

Summary

This release primarily focuses on providing pre-built binaries for various operating systems and hardware configurations, including specific CUDA versions (12.4 and 13.1) for Windows, and includes tests for matrix multiplication with huge batch sizes.