b9581
📦 llama-cppView on GitHub →
🐛 1 fixes🔧 1 symbols
Summary
This release focuses on performance optimization within the Vulkan backend by reducing shared memory usage for matrix multiplication operations.
🐛 Bug Fixes
- Vulkan: Reduced iq1 shared memory usage for mul_mm operation.