Skip to content

Commit e761665

Browse files
committed
docs fix
1 parent 133189e commit e761665

File tree

3 files changed

+27
-22
lines changed

3 files changed

+27
-22
lines changed
134 KB
Loading

docs/my-website/release_notes/v1.78.0-stable/index.md

Lines changed: 26 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -59,23 +59,45 @@ pip install litellm==1.78.0.rc.1
5959
## Key Highlights
6060

6161
- **MCP Gateway - Control Tool Access by Team, Key** - Control MCP tool access by team/key.
62+
- **Performance Improvements** - 70% Lower p99 Latency
6263
- **GPT-5 Pro & GPT-Image-1-Mini** - Day 0 support for OpenAI's GPT-5 Pro (400K context) and gpt-image-1-mini image generation
63-
- **UI Performance Boost** - Replaces bloated key list calls with lean key aliases endpoint, Turbopack for faster development, and major UI refactors
6464
- **EnkryptAI Guardrails** - New guardrail integration for content moderation
6565
- **Tag-Based Budgets** - Support for setting budgets based on request tags
66-
- **Azure AD & SSO** - Enhanced Azure AD default credentials selection and EntraID app roles support
6766

6867
---
6968

70-
## Performance
69+
### MCP Gateway - Control Tool Access by Team, Key
70+
71+
<Image
72+
img={require('../../img/release_notes/tool_control.png')}
73+
style={{width: '100%', display: 'block', margin: '2rem auto'}}
74+
/>
75+
76+
<br/>
77+
78+
Proxy admins can now control MCP tool access by team or key. This makes it easy to grant different teams selective access to tools from the same MCP server.
79+
80+
For example, you can now give your Engineering team access to `list_repositories`, `create_issue`, and `search_code` tools, while Sales only gets `search_code` and `close_issue` tools.
81+
82+
This makes it easier for Proxy Admins to govern MCP Tool Access.
83+
84+
[Get Started](../../docs/mcp_control#set-allowed-tools-for-a-key-team-or-organization)
85+
86+
---
87+
88+
## Performance - 70% Lower p99 Latency
89+
90+
<Image img={require('../../img/release_notes/1_78_0_perf.png')} style={{ width: '800px', height: 'auto' }} />
91+
92+
<br/>
7193

7294
This release cuts p99 latency by 70% on LiteLLM AI Gateway, making it even better for low-latency use cases.
7395

7496
These gains come from two key enhancements:
7597

7698
**Reliable Sessions**
7799

78-
`shared_session` is now consistently used across all calls, resolving JSON errors, enabling connection pooling, and actively managing session lifecycles—detecting stale connections, retrying expired sessions, and preventing 'Session is closed' errors.
100+
Added support for shared sessions with aiohttp. The shared_session parameter is now consistently used across all calls, enabling connection pooling.
79101

80102
**Faster Routing**
81103

@@ -112,23 +134,6 @@ View the complete configuration: [gist.github.com/AlexsanderHamir/config.yaml](
112134
View the complete load testing script: [gist.github.com/AlexsanderHamir/no_cache_hits.py](https://gist.github.com/AlexsanderHamir/42c33d7a4dc7a57f56a78b560dee3a42)
113135

114136
---
115-
### MCP Gateway - Control Tool Access by Team, Key
116-
117-
<Image
118-
img={require('../../img/release_notes/tool_control.png')}
119-
style={{width: '100%', display: 'block', margin: '2rem auto'}}
120-
/>
121-
122-
<br/>
123-
124-
Proxy admins can now control MCP tool access by team or key. This makes it easy to grant different teams selective access to tools from the same MCP server.
125-
126-
For example, you can now give your Engineering team access to `list_repositories`, `create_issue`, and `search_code` tools, while Sales only gets `search_code` and `close_issue` tools.
127-
128-
This makes it easier for Proxy Admins to govern MCP Tool Access.
129-
130-
[Get Started](../../docs/mcp_control#set-allowed-tools-for-a-key-team-or-organization)
131-
132137

133138
## New Models / Updated Models
134139

litellm/model_prices_and_context_window_backup.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4841,7 +4841,7 @@
48414841
"input_cost_per_token_above_200k_tokens": 6e-06,
48424842
"litellm_provider": "anthropic",
48434843
"max_input_tokens": 1000000,
4844-
"max_output_tokens": 1000000,
4844+
"max_output_tokens": 64000,
48454845
"max_tokens": 1000000,
48464846
"mode": "chat",
48474847
"output_cost_per_token": 1.5e-05,

0 commit comments

Comments
 (0)