b8875

📅 Apr 21, 2026📦 llama-cppView on GitHub →

✨ 5 features🐛 4 fixes🔧 25 symbols

Summary

This release introduces comprehensive support for Reka Edge 2603, including a new Yasa2 vision encoder, a dedicated chat template handler, and GGUF conversion scripts. Numerous internal cleanups were performed, particularly around tensor handling and GGML operations.

Migration Steps

When using the chat parser, use autoparser::generation_params instead of templates_params.
Update image preprocessing settings for Reka Edge models to use image_resize_algo = RESIZE_ALGO_BICUBIC and image_resize_pad = false.

✨ New Features

Added support for Reka Edge 2603.
Added Yasa2 vision encoder support for mtmd (Reka Edge).
Added reka-edge template handler for chat (tools, thinking) using a PEG-based parser.
Added script to convert Reka Yasa2 HF checkpoints to GGUF format, including text decoder, mmproj (ConvNeXt + projection), and 2D sincos positional embeddings.
Added Reka Edge mixed quantization helper script for Q4_0 base quantization with Q8_0 override for the last 8 transformer blocks.

🐛 Bug Fixes

Fixed stray text appended in llama_decode_text by removing accidental concatenation of the full text string when formatting UNK_BYTE hex escapes.
Avoided duplicate tensor loading in the yasa2 vision encoder where TN_YASA_PATCH_W and TN_PATCH_EMBD resolved to the same tensor name, preventing memory pool overflow.
Simplified reasoning parser in chat-reka to match LFM2 pattern.
Removed image/video oracle tests due to incompatibility with oaicompat parser.

Summary

Migration Steps

✨ New Features

🐛 Bug Fixes

Affected Symbols