b7649

📅 Jan 6, 2026📦 llama-cppView on GitHub →

✨ 1 features🔧 1 symbols

Summary

This release focuses on performance improvements within ggml, specifically optimizing the CUDA ssm_scan operation using warp-level reduction.

ggml