b9874
📦 llama-cppView on GitHub →
✨ 1 features🔧 1 symbols
Summary
This release introduces CUDA support for concatenating quantized types and includes minor code cleanup.
✨ New Features
- Added concat implementation for quantized types on CUDA.
This release introduces CUDA support for concatenating quantized types and includes minor code cleanup.