Routing, Loadbalancing & Fallbacks

📄️ Router - Load Balancing

LiteLLM manages:

Beta feature. Use for testing only.

LiteLLM can auto select the best model for a request based on rules you define.

Load balance multiple instances of the same model

LiteLLM Supports setting the following budgets:

If a call fails after num_retries, fallback to another model group.

Route requests based on tags.

The timeout set in router is for the entire length of the call, and is passed down to the completion() call level as well.

Proxy all models from a provider