Change8

b9581

📦 llama-cppView on GitHub →
🐛 1 fixes🔧 1 symbols

Summary

This release focuses on performance optimization within the Vulkan backend by reducing shared memory usage for matrix multiplication operations.

🐛 Bug Fixes

  • Vulkan: Reduced iq1 shared memory usage for mul_mm operation.

Affected Symbols