Provider Routing

ARouter routes each request to the optimal upstream provider based on model availability, provider health, and cost efficiency. This happens automatically on the server side. For most applications, you control routing with the model field, optional models candidate lists, route, and model suffixes such as :nitro or :floor. You do not need a separate provider configuration block in the request body for standard ARouter routing.

How Provider Selection Works

When you send a request with a model like openai/gpt-5.4, ARouter:

Identifies the target provider from the model prefix
Checks API key health and availability (circuit-breaker)
Selects the best available key from the provider’s key pool
Forwards the request, injecting the provider key transparently

If no healthy keys are available for the specified provider, ARouter returns an error with status 503 Service Unavailable.

Default Strategy: Cost-Based Load Balancing

By default, ARouter balances load across healthy provider keys using a cost-aware strategy. Providers with better cost-per-token ratios are preferred. This runs entirely server-side — no configuration required.

Specifying a Provider via Model Prefix

The primary way to control which provider handles your request is via the provider/model format in the model field:

{
  "model": "openai/gpt-5.4",
  "messages": [{ "role": "user", "content": "Hello!" }]
}

See Model Routing for the full list of supported formats.

Ordered Model Lists

When you want ARouter to try multiple candidate models in a specific order, send an ordered models array together with a route mode:

{
  "models": [
    "anthropic/claude-opus-4.5",
    "openai/gpt-5.4",
    "google/gemini-2.5-flash"
  ],
  "route": "fallback",
  "messages": [{ "role": "user", "content": "Hello!" }]
}

ARouter evaluates the candidates in order and returns the first successful result. See Model Routing for the full request shape.

Provider Variants via `:nitro` and `:floor`

Some models are offered in multiple variants. Use model suffixes to select performance tiers:

`:nitro` — Maximum Throughput

Append :nitro to route to the fastest available instance of a model, optimized for throughput over cost:

{
  "model": "openai/gpt-5.4:nitro",
  "messages": [{ "role": "user", "content": "Hello!" }]
}

:nitro variants are suitable for real-time applications where latency matters most.

`:floor` — Minimum Cost

Append :floor to route to the lowest-cost available instance:

{
  "model": "openai/gpt-5.4:floor",
  "messages": [{ "role": "user", "content": "Hello!" }]
}

:floor variants are ideal for batch processing and offline workloads.

Provider Health and Availability

ARouter continuously tracks the health of each provider’s API keys using a circuit-breaker mechanism:

Healthy: The provider is accepting requests normally
Degraded: The provider has recent failures; requests may be retried with a different key
Unavailable: All keys for this provider are circuit-broken; ARouter returns an error

This health tracking is transparent — your application does not need to implement retry logic for provider-level failures.

Using the Native Provider Proxy

For complete control, use the provider proxy endpoint /{provider}/{path} to send requests directly to a specific provider, bypassing ARouter’s model-routing layer:

# Direct to OpenAI
curl https://api.arouter.ai/openai/v1/chat/completions \
  -H "Authorization: Bearer lr_live_xxxx" \
  -d '{"model": "gpt-5.4", "messages": [...]}'

# Direct to Anthropic
curl https://api.arouter.ai/anthropic/v1/messages \
  -H "Authorization: Bearer lr_live_xxxx" \
  -d '{"model": "claude-sonnet-4.6", "messages": [...]}'

See Provider Proxy for the full reference.

Supported Providers

Provider	Prefix	Example Model
OpenAI	`openai`	`openai/gpt-5.4`
Anthropic	`anthropic`	`anthropic/claude-sonnet-4.6`
Google	`google`	`google/gemini-2.5-flash`
DeepSeek	`deepseek`	`deepseek/deepseek-v3.2`
xAI	`x-ai`	`x-ai/grok-4.20`
Mistral	`mistralai`	`mistralai/mistral-large-2512`
Meta	`meta-llama`	`meta-llama/llama-4-maverick`
Qwen	`qwen`	`qwen/qwen3-235b`
MiniMax	`minimax`	`minimax/minimax-m2.7`
Groq	`groq`	`groq/llama-3.3-70b-versatile`
Kimi	`kimi`	`kimi/moonshot-v2`
Dashscope	`dashscope`	`dashscope/qwen-max`

See Providers for the full list with capabilities.

Get Started

Core Concepts

Features

Guides

Privacy

Administration

Best Practices

Frameworks & Integrations

Support

Provider Routing

How Provider Selection Works

Default Strategy: Cost-Based Load Balancing

Specifying a Provider via Model Prefix

Ordered Model Lists

Provider Variants via `:nitro` and `:floor`

`:nitro` — Maximum Throughput

`:floor` — Minimum Cost

Provider Health and Availability

Using the Native Provider Proxy

Supported Providers

Get Started

Core Concepts

Features

Guides

Privacy

Administration

Best Practices

Frameworks & Integrations

Support

​How Provider Selection Works

​Default Strategy: Cost-Based Load Balancing

​Specifying a Provider via Model Prefix

​Ordered Model Lists

​Provider Variants via :nitro and :floor

​:nitro — Maximum Throughput

​:floor — Minimum Cost

​Provider Health and Availability

​Using the Native Provider Proxy

​Supported Providers

How Provider Selection Works

Default Strategy: Cost-Based Load Balancing

Specifying a Provider via Model Prefix

Ordered Model Lists

Provider Variants via `:nitro` and `:floor`

`:nitro` — Maximum Throughput

`:floor` — Minimum Cost

Provider Health and Availability

Using the Native Provider Proxy

Supported Providers