b8804

Breaking Changes

📅 Apr 15, 2026📦 llama-cppView on GitHub →

⚠ 1 breaking✨ 2 features

Summary

This release introduces a security enhancement by requiring explicit opt-in for CUDA P2P access and provides extensive pre-built binaries across multiple platforms and hardware accelerators.

⚠️ Breaking Changes

CUDA P2P (Peer-to-Peer) access is now disabled by default and requires explicit opt-in to enhance security and control.

Migration Steps

If your application relied on CUDA P2P access, you must now explicitly enable it.

✨ New Features

Added support for CUDA P2P access via explicit opt-in.
Provided pre-built binaries for macOS (Apple Silicon and Intel), Linux (various architectures and backends like Vulkan, ROCm 7.2, OpenVINO), Windows (CPU, CUDA 12.4, CUDA 13.1, Vulkan, SYCL, HIP), and openEuler.