Change8

Migrating to llama.cpp b9591

Version b9591 introduces 1 breaking change. This guide details how to update your code.

Released: 6/10/2026

1
Breaking Changes
1
Migration Steps
2
Affected Symbols

⚠️ Check Your Code

If you use any of these symbols, you need to read this guide:

ggml_gated_delta_netggml_cpy

Breaking Changes

Issue #1

The ggml_gated_delta_net function signature has changed: it now takes only the initial recurrent state (D, 1, n_seqs) and expects the snapshot count K as an op parameter, instead of inferring K from state->ne[1].

Migration Steps

  1. 1
    If you directly call ggml_gated_delta_net, update the call signature to pass the snapshot count K as an op parameter instead of relying on state->ne[1] for inference.

Release Summary

This release focuses on internal optimizations by removing padding and multiple D2D copies for MTP, alongside updating the ggml_gated_delta_net interface. It also includes fixes for CI builds.

Need More Details?

View the full release notes and all changes for llama.cpp b9591.

View Full Changelog