b8507
📦 llama-cppView on GitHub →
✨ 1 features🔧 1 symbols
Summary
This release re-enables graph reuse functionality within the ggml backend when pipeline parallelism is active. It also provides updated binary distributions for numerous operating systems and hardware configurations.
✨ New Features
- Re-enabled graph reuse when using pipeline parallelism in the ggml backend.