Change8

b9852

📦 llama-cppView on GitHub →
2 features🔧 1 symbols

Summary

This release introduces initial support for the q1_0 quantization format on the OpenCL backend, including specific optimizations for Adreno hardware. It also provides numerous pre-compiled binaries for diverse operating systems and hardware configurations.

✨ New Features

  • Initial support for q1_0 quantization format added to OpenCL backend.
  • Added Adreno GEMM/GEMV kernels for q1_0 support on OpenCL.

Affected Symbols