v2.29.0

Breaking Changes

📅 May 12, 2025📦 localaiView on GitHub →

⚠ 3 breaking✨ 6 features🔧 1 symbols

Summary

LocalAI v2.29.0 introduces a major overhaul to container images, slimming down defaults and introducing an \`-extras\` suffix for optional dependencies, alongside new model support and experimental video generation capabilities.

⚠️ Breaking Changes

Container images containing extra Python dependencies (like those for `diffusers`) now require the \`-extras\` suffix (e.g., \`latest-gpu-nvidia-cuda-12-extras\`). Default images are now slimmer and do not include these dependencies.
The separate \`-ffmpeg\` container image tags have been removed because FFmpeg is now included in all core images. Users should switch to the corresponding base image tag (e.g., \`latest-gpu-hipblas-ffmpeg\` becomes \`latest-gpu-hipblas\`).
The AutoGPTQ backend has been dropped because it was discontinued upstream.

Migration Steps

If you relied on extra Python dependencies (like diffusers) in your container, update your image pull command to use the new \`-extras\` suffix (e.g., change \`latest-gpu-nvidia-cuda-12\` to \`latest-gpu-nvidia-cuda-12-extras\` if you need those dependencies).
If you were using an image tag ending in \`-ffmpeg\`, remove \`-ffmpeg\` from the tag (e.g., change \`latest-gpu-hipblas-ffmpeg\` to \`latest-gpu-hipblas\`).
If you were using the AutoGPTQ backend, you must switch to an alternative backend as it has been removed.

✨ New Features

Official support for the Qwen3 model family has been integrated.
Experimental Auto GPU Offload feature added, attempting to automatically detect GPUs and configure optimal layer offloading for llama.cpp and CLIP.
Whisper.cpp now supports GPU acceleration via cuBLAS (NVIDIA) and Vulkan.
Introduction of an experimental \`/video/generations\` endpoint.
The \`install.sh\` script now includes a \`--uninstall\` flag for easy removal.
Expanded Hipblas targets to include: \`gfx803,gfx900,gfx906,gfx908,gfx90a,gfx942,gfx1010,gfx1030,gfx1032,gfx1100,gfx1101,gfx1102\`.

🔧 Affected Symbols

AutoGPTQ backend