b7593
📦 llama-cppView on GitHub →
✨ 3 features🔧 2 symbols
Summary
This release introduces support for the Nvidia Music Flamingo model and optimizes the GGUF conversion script by extending the whisper implementation.
✨ New Features
- Added support for Nvidia Music Flamingo Model
- Extended whisper implementation in hf_to_gguf to reduce code duplication
- Added support for q5_k_s quantization for Music Flamingo
🔧 Affected Symbols
convert_hf_to_ggufwhisper