Change8

b7593

📦 llama-cppView on GitHub →
3 features🔧 2 symbols

Summary

This release introduces support for the Nvidia Music Flamingo model and optimizes the GGUF conversion script by extending the whisper implementation.

✨ New Features

  • Added support for Nvidia Music Flamingo Model
  • Extended whisper implementation in hf_to_gguf to reduce code duplication
  • Added support for q5_k_s quantization for Music Flamingo

🔧 Affected Symbols

convert_hf_to_ggufwhisper