Change8

b8785

📦 llama-cppView on GitHub →
1 features🔧 4 symbols

Summary

This release introduces NVFP4 support within the Vulkan backend for several core tensor operations. It also provides a comprehensive set of pre-compiled binaries for diverse operating systems and hardware accelerators.

✨ New Features

  • Added support for GGML_TYPE_NVFP4 in Vulkan backend for get_rows, dequant, and mul_mat(_id) operations.

Affected Symbols