b8191

📅 Mar 3, 2026📦 llama-cppView on GitHub →

✨ 1 features🐛 1 fixes🔧 1 symbols

Summary

This release introduces an optimized Q4_1 OpenCL kernel for Adreno GPUs and includes various refactoring and formatting changes within the OpenCL implementation. It also fixes a build issue for non-Adreno OpenCL targets.

✨ New Features

Added optimized Q4_1 matrix multiplication kernel for Adreno GPUs via OpenCL.

🐛 Bug Fixes

Fixed OpenCL build for non-Adreno devices.

Affected Symbols

ggml_cl_mul_mat_q4_1_f32_adreno