b9536

📅 Jun 5, 2026📦 llama-cppView on GitHub →

✨ 5 features🔧 4 symbols

Summary

This release focuses on significant performance improvements and optimizations within the OpenCL backend for various matrix operations. Several platform-specific builds were disabled.

✨ New Features

OpenCL: Improved performance for get_rows, cpy, concat, and q6_k flat gemv operations.
OpenCL: Enabled support for multiple workgroups for large rows.
OpenCL: Improved performance for small copy operations (cpy).
OpenCL: Implemented packed concatenation for small inputs.
OpenCL: Tweaked flat q6_K gemv by increasing N_DST and remapping threads.

Affected Symbols

opencl:get_rows opencl:cpy opencl:concat opencl:q6_k_flat_gemv