Base URL
Authentication
All endpoints (except/healthz) require authentication via one of:
| Method | Header / Parameter | Used By |
|---|---|---|
| Bearer Token | Authorization: Bearer <key> | OpenAI SDK, most clients |
| API Key Header | X-Api-Key: <key> | Anthropic SDK |
| Query Parameter | ?key=<key> | Gemini SDK |
Endpoint Groups
OpenAI Compatible
/v1/chat/completions, /v1/embeddings, /v1/modelsUse with any OpenAI-compatible SDK. Supports provider/model routing.Anthropic Native
/v1/messages, /v1/messages/batches, /v1/messages/count_tokensDrop-in compatible with the Anthropic SDK.Gemini Native
/v1beta/models/{model}:generateContentDrop-in compatible with the Google Gemini SDK.Key Management
/api/v1/keysCreate, list, update, and delete API keys via the management API.Response Format
All responses are JSON. Successful responses match the upstream provider’s format exactly. Error responses follow a consistent format:Rate Limits
Rate limits are applied per API key. Default limits can be customized per key via the Dashboard or via key configuration in the management API.| Header | Description |
|---|---|
X-RateLimit-Limit | Maximum requests per window |
X-RateLimit-Remaining | Requests remaining |
X-RateLimit-Reset | Window reset time (Unix timestamp) |