litellm_sso-dev-v1.81.13

📅 Feb 20, 2026📦 litellmView on GitHub →

✨ 3 features🐛 29 fixes🔧 6 symbols

Summary

This release focuses heavily on improving test stability across various integrations (Langfuse, Vertex AI, SSO) and introduces UI enhancements for Access Groups. It also includes fixes for parameter handling in chat models and updates to managed batch functionality.

Migration Steps

If you were relying on the specific value of the reasoning parameter, update your code to use None instead of Reasoning().

✨ New Features

Added Access Group Selector to Create and Edit Flow in the UI for Keys/Teams.
Added routing logic based on whether reasoning is supported.
Added documentation for OpenAI Agents SDK with LiteLLM.

🐛 Bug Fixes

Preserved key_alias and team_id metadata in /user/daily/activity/aggregated after key deletion or regeneration.
Used None instead of Reasoning() for the reasoning parameter.
Removed unused Reasoning import from transformation.py.
Fixed Langfuse test isolation to prevent flaky failures.
Resolved merge conflict and fixed bedrock thinking test flakiness.
Fixed flaky tests caused by module reloading and missing mocks.
Fixed SSO test flakiness by correctly mocking premium_user.
Made policy_resolve_endpoints importable without FastAPI.
Improved Langfuse test isolation to prevent flaky failures.
Added mock isolation for test_video_content_handler_uses_get_for_openai.
Added cleanup for disable_aiohttp_transport in test_extra_body_with_fallback.
Restored Langfuse client counter in test cleanup.
Removed redundant cache flush from test_openai_env_base.
Added environment cleanup for Vertex AI rerank tests.
Updated reasoning_effort test to expect dict format.
Added environment cleanup for Vertex AI GPT-OSS tests.
Added environment cleanup for Vertex AI Qwen tests.
Used async side_effect for client.post mock in watsonx test.
Mocked vertexai module in GPT-OSS tests to prevent authentication.
Updated test_other_constraints_preserved for new schema filtering.
Added fakeredis for pod lock manager tests.
Cleared tokenizer LRU cache for test isolation.
Mocked environment variables for callback validation test.
Fixed unsupported parameters for chatgpt codex.
Fixed converse anthropic usage object according to v1/messages specs.
Applied miscellaneous bug fixes for Managed batches.
Clamped thinking.budget_tokens to a minimum of 1024 for Bedrock.
Fixed virtual key grace period retrieval from env/UI.
Fixed SSO PKCE support failure in multi-pod Kubernetes deployments.

Affected Symbols

policy_resolve_endpoints Reasoning transformation.py key_alias team_id thinking.budget_tokens