v1.81.9-stable
📦 litellmView on GitHub →
✨ 17 features🐛 29 fixes🔧 15 symbols
Summary
This release focuses heavily on bug fixes across various providers (including Anthropic, Vertex AI, and GigaChat) and significant enhancements to the UI, including new budget management features and improved admin panels. It also introduces support for new models like Qwen3-235B and Claude Opus 4.6.
Migration Steps
- If using A2A agents deployed with localhost/internal URLs in agent cards (e.g., http://0.0.0.0:8001/), review the fix applied in PR #20604.
✨ New Features
- Added faster linting targets for development workflow.
- UI: Show Config Defined Search Tools.
- UI: Add support for MCP Semantic Filtering.
- OpenRouter: Added Qwen3-235B models.
- Support TTL(1h) field in prompt caching for Bedrock Claude 4.5 models.
- Added CLI arguments for RDS IAM auth.
- [Feat] add `claude-opus-4-6` to model cost map.
- Added Claude Opus 4.6 support.
- UI: Add soft_budget field to Team Table + Create/Update Endpoints.
- Web Search: Added gpt-5-search-api model and documentation clarifications.
- Added ElevenLabs eleven_v3 and eleven_multilingual_v2 to model cost map.
- Team Soft Budget Email Alerts.
- UI: Admin Settings: Add option for Authentication for public AI Hub.
- Added full support for Opus 4.6 (Anthropic, Azure AI, Bedrock, Vertex AI).
- Added INFO-level session reuse logging per request.
- MCP Gateway: Allow setting MCP Servers as Private/Public available on Internet.
- Added IP-Based Access Control feature.
🐛 Bug Fixes
- Fixed search tools not being found when using per-request routers.
- Fixed Langfuse OpenTelemetry trace issues.
- Preserved streaming content on guardrail-sampled chunks.
- Fixed Unique Constraint on Daily Tables + Logging When Updates Fail.
- Fixed mypy regression: TypedDict key error in fireworks_ai transformation.
- Fixed inconsistent response format in anthropic.messages.acreate() when using non-Anthropic providers.
- Fixed lint: removed unused Any/cast imports in github_copilot transformation.
- Disabled merging of consecutive user messages for GigaChat provider.
- Fixed Vertex AI Gemini streaming content_filter handling.
- Fixed 404 Not Found on /api/event_logging/batch endpoint.
- Fixed UI daily spend date filtering for user timezone.
- Fixed Non Root Dockerfile: Kept package-lock.json.
- Fixed test isolation for test_watsonx_gpt_oss_prompt_transformation.
- Fixed test isolation for test_log_langfuse_v2_handles_null_usage_values.
- Guardrails API: Ensured OpenAI Moderations Guard works with OpenAI Embeddings.
- Fixed gcs_bucket_name passing issue.
- Fixed array type checks for model, agent, and MCP hub data.
- Aligned Claude Opus 4.6 metadata and limits.
- Added unsupported claude code beta headers in json.
- Fixed Prometheus budget metrics parallelization, caching bug, and reduced CPU usage.
- Warned when budget lookup fails; cache won't populate.
- Fixed UI Model Info Page: Input and Output Labels.
- Fixed UI Model Page: Column Resizing on Smaller Screens.
- Fixed MCP: resolve OAuth2 'Capabilities: none' bug for upstream MCP servers.
- A2a Agent Gateway Fixes: Addressed issue with A2A agents deployed with localhost/internal URLs in agent cards.
- Re-issued fix for Keys and Teams Router Setting + Allowed Override of Router Settings.
- Fixed SSO: extracted user roles from JWT access token for Keycloak compatibility.
- Fixed mypy: resolved missing return statements and type casting issues.
- Added HTTP support to custom code guardrails + Unified guardrails for MCP + Agent guardrail support.
Affected Symbols
fireworks_ai transformationgithub_copilot transformationanthropic.messages.acreate()Vertex AI Gemini/api/event_logging/batchBedrock Claude 4.5GigaChat providerOpenAI Embeddingsgcs_bucket_namewatsonx_gpt_oss_prompt_transformationlog_langfuse_v2_handles_null_usage_valuesprometheus metricsKeycloak SSOMCP GatewayA2A Agent Gateway