๐๏ธ Router - Load Balancing
LiteLLM manages:
๐๏ธ [BETA] Request Prioritization
Beta feature. Use for testing only.
๐๏ธ Auto Routing
LiteLLM can auto select the best model for a request based on rules you define.
๐๏ธ Proxy - Load Balancing
Load balance multiple instances of the same model
๐๏ธ Budget Routing
LiteLLM Supports setting the following budgets:
๐๏ธ Fallbacks
If a call fails after num_retries, fallback to another model group.
๐๏ธ Tag Based Routing
Route requests based on tags.
๐๏ธ Timeouts
The timeout set in router is for the entire length of the call, and is passed down to the completion() call level as well.
๐๏ธ Provider specific Wildcard routing
Proxy all models from a provider