Change8

b9194

📦 llama-cppView on GitHub →
1 features

Summary

This release introduces a performance optimization on the Vulkan backend by fusing SSM_CONV, BIAS, and SILU operations. It also provides extensive pre-compiled binaries for macOS, Linux, Android, Windows, and openEuler.

✨ New Features

  • Vulkan backend now fuses SSM_CONV, BIAS, and SILU operations for potential performance improvements.