Deploy this version​
- Docker
- Pip
docker run
-e STORE_MODEL_IN_DB=True
-p 4000:4000
ghcr.io/berriai/litellm:main-v1.67.4-stable
pip install litellm==1.67.4.post1
Key Highlights​
- Improved UI Usability and Security: This release brings usability and security improvements to the Admin UI, making it easier to manage access across 200+ users.
- Responses API Load Balancing: Route requests across provider regions and ensure session continuity.
- UI Session Logs: Group several requests to LiteLLM into a session.
New Models / Updated Models​
- OpenAI
- Added
gpt-image-1
cost tracking Get Started - Bug fix: added cost tracking for gpt-image-1 when quality is unspecified PR
- Added
- Azure
- Fixed timestamp granularities passing to whisper in Azure Get Started
- Added azure/gpt-image-1 pricing Get Started, PR
- Added cost tracking for
azure/computer-use-preview
,azure/gpt-4o-audio-preview-2024-12-17
,azure/gpt-4o-mini-audio-preview-2024-12-17
PR
- Bedrock
- Added support for all compatible Bedrock parameters when model="arn:.." (Bedrock application inference profile models) Get started, PR
- Fixed wrong system prompt transformation PR
- VertexAI / Google AI Studio
- Allow setting
budget_tokens=0
forgemini-2.5-flash
Get Started,PR - Ensure returned
usage
includes thinking token usage PR - Added cost tracking for
gemini-2.5-pro-preview-03-25
PR
- Allow setting
- Cohere
- Added support for cohere command-a-03-2025 Get Started, PR
- SageMaker
- Added support for max_completion_tokens parameter Get Started, PR
- Responses API
- Added support for GET and DELETE operations -
/v1/responses/{response_id}
Get Started - Added session management support for non-OpenAI models PR
- Added routing affinity to maintain model consistency within sessions Get Started, PR
- Added support for GET and DELETE operations -
Spend Tracking Improvements​
- Bug Fix: Fixed spend tracking bug, ensuring default litellm params aren't modified in memory PR
- Deprecation Dates: Added deprecation dates for Azure, VertexAI models PR
Management Endpoints / UI​
Users​
Filtering and Searching:
- Filter users by user_id, role, team, sso_id
- Search users by email
User Info Panel: Added a new user information pane PR
Teams​
Keys​
- Key Management:
UI Logs Page​
- Session Logs: Added UI Session Logs Get Started
UI Authentication & Security​
- Required Authentication: Authentication now required for all dashboard pages PR
- SSO Fixes: Fixed SSO user login invalid token error PR
- Encrypted Tokens: Moved UI to encrypted token usage PR
- Token Expiry: Added token expiry logic to user dashboard PR
UI General fixes​
- Fixed UI Flicker: Addressed UI flickering issues in Dashboard PR
- Improved Terminology: Better loading and no-data states on Keys and Tools pages PR
- Azure Model Support: Fixed editing Azure public model names and changing model names after creation PR
- Team Model Selector: Bug fix for team model selection PR
Logging / Guardrail Integrations​
- Datadog:
- Fixed Datadog LLM observability logging Get Started, PR
- Prometheus / Grafana:
- Enable datasource selection on LiteLLM Grafana Template Get Started, PR
- AgentOps:
- Added AgentOps Integration Get Started, PR
- Arize:
- Added missing attributes for Arize & Phoenix Integration Get Started, PR
General Proxy Improvements​
- Caching: Fixed caching to account for
thinking
orreasoning_effort
when calculating cache key PR - Model Groups: Fixed handling for cases where user sets model_group inside model_info PR
- Passthrough Endpoints: Ensured
PassthroughStandardLoggingPayload
is logged with method, URL, request/response body PR - Fix SQL Injection: Fixed potential SQL injection vulnerability in spend_management_endpoints.py PR
Helm​
- Fixed serviceAccountName on migration job PR
Full Changelog​
The complete list of changes can be found in the GitHub release notes.