b9852
📦 llama-cppView on GitHub →
✨ 2 features🔧 1 symbols
Summary
This release introduces initial support for the q1_0 quantization format on the OpenCL backend, including specific optimizations for Adreno hardware. It also provides numerous pre-compiled binaries for diverse operating systems and hardware configurations.
✨ New Features
- Initial support for q1_0 quantization format added to OpenCL backend.
- Added Adreno GEMM/GEMV kernels for q1_0 support on OpenCL.