b9817
📦 llama-cppView on GitHub →
✨ 4 features🐛 2 fixes🔧 6 symbols
Summary
This release updates the OpenVINO backend to version 2026.2.1, introduces self-contained release packages, and includes several operator improvements for the OpenVINO backend.
✨ New Features
- OpenVINO backend now supports softmax with sink input.
- Optimized multiplication and matrix ID conversion process for large sizes in OpenVINO backend.
- Modified add_id operation in OpenVINO backend to support 2D/4D tensors.
- Added glu_swiglu_oai operator support in OpenVINO backend.
🐛 Bug Fixes
- Reverted accidental removal of get_op_type.
- Fixed path consistency issues during PR review.