b7632

📅 Jan 5, 2026📦 llama-cppView on GitHub →

🐛 3 fixes🔧 4 symbols

Summary

This release focuses on bug fixes, primarily addressing issues within the Vulkan backend related to quantization overflow and matrix multiplication failures across different hardware configurations.

🐛 Bug Fixes

Handled quantize_q8_1 overflowing the max workgroup count in Vulkan backend.
Fixed small tile size matrix multiplication on Lavapipe.
Fixed mul_mat_id failures.

🔧 Affected Symbols

vulkanquantize_q8_1matmulmul_mat_id