Uptime Optimization

ARouter continuously monitors the health and availability of all upstream AI providers to ensure maximum uptime for your applications.

How It Works

ARouter tracks response times, error rates, and availability across all providers in real time. This data drives intelligent routing decisions and helps surface reliability information in your Activity feed. When a provider experiences degraded performance or an outage, ARouter automatically adjusts routing weights to deprioritize that provider — without any change required on your side.

What ARouter Monitors

For each provider and model, ARouter continuously tracks:

Success rate: Percentage of requests that complete without error
Time to first token (TTFT): Latency from request submission to first streaming token
Total response time: End-to-end latency for non-streaming responses
Error types: Distinguishes between transient errors (5xx, rate limits) and permanent errors (invalid model, bad request)

Automatic Routing Around Outages

When ARouter detects that a provider is degraded:

The provider’s routing weight is reduced or zeroed temporarily
Subsequent requests are routed to other healthy providers serving the same model family
The provider is re-evaluated periodically and reintroduced once health metrics recover

This happens transparently — your requests continue to succeed even during provider incidents.

Customizing for Higher Availability

Use Ordered Candidate Model Lists

For critical workloads, specify an ordered list of models. ARouter tries each in sequence until one succeeds:

{
  "models": [
    "anthropic/claude-sonnet-4-6",
    "openai/gpt-5.4",
    "google/gemini-2.5-pro"
  ],
  "route": "fallback"
}

This ensures your application keeps working even if the primary model’s providers are fully unavailable. See Model Routing for the full configuration reference.

Use Auto Routing

Set model: "auto" to let ARouter dynamically select the best available model based on current provider health, cost, and capability:

{ "model": "auto" }

See Model Routing — Auto Routing.

Use `:floor` for Cost-Stable Routing

The :floor suffix routes to the lowest-cost provider serving a model, which is often a different provider than the default — providing natural diversity:

{ "model": "anthropic/claude-sonnet-4-6:floor" }

Monitoring Provider Health

View real-time provider health data in the ARouter Dashboard. The dashboard shows per-model availability and latency trends so you can make informed decisions about model selection for your workloads.

Get Started

Core Concepts

Features

Guides

Privacy

Administration

Best Practices

Frameworks & Integrations

For Providers

Support

Uptime Optimization

How It Works

What ARouter Monitors

Automatic Routing Around Outages

Customizing for Higher Availability

Use Ordered Candidate Model Lists

Use Auto Routing

Use `:floor` for Cost-Stable Routing

Monitoring Provider Health

Get Started

Core Concepts

Features

Guides

Privacy

Administration

Best Practices

Frameworks & Integrations

For Providers

Support

​How It Works

​What ARouter Monitors

​Automatic Routing Around Outages

​Customizing for Higher Availability

​Use Ordered Candidate Model Lists

​Use Auto Routing

​Use :floor for Cost-Stable Routing

​Monitoring Provider Health

How It Works

What ARouter Monitors

Automatic Routing Around Outages

Customizing for Higher Availability

Use Ordered Candidate Model Lists

Use Auto Routing

Use `:floor` for Cost-Stable Routing

Monitoring Provider Health