API Reference
The full Routero AI API is documented below via the auto-generated OpenAPI spec from the gateway. The spec covers both the data plane (inference endpoints) and the management/control plane (key, team, org, budget, guardrail, prompt, memory, and token-saving plan management).
Base URL: https://api.routero.ai/v1
Authentication: Authorization: Bearer YOUR_ROUTERO_KEY on all requests.
Interactive reference
Key endpoint groups
Inference (data plane) — /v1/...
| Endpoint | Description |
|—|—|
| POST /chat/completions | OpenAI-compatible chat completions (primary endpoint) |
| POST /completions | Legacy text completions |
| POST /embeddings | Text embeddings |
| POST /images/generations | Image generation |
| POST /audio/speech | Text-to-speech |
| POST /audio/transcriptions | Speech-to-text |
| POST /rerank | Reranking (Cohere-compatible) |
| POST /batches | Async batch processing |
| GET /models | List available models |
| POST /v1/messages | Anthropic Messages API compatibility |
Management (control plane) — /...
| Resource | Prefix |
|—|—|
| API Keys | /key/ |
| Organizations | /organization/ |
| Teams | /team/ |
| Users | /user/ |
| Budgets | /budget/ |
| Billing & Wallet | /billing/ |
| Guardrails | /guardrail/ |
| Prompts | /prompts/ |
| Memory Sessions | /memory/session/ |
| Token-Saving Plans | /token-saving/plans/ |
| Models | /model/ |
| Routing / Fallbacks | /fallbacks/ |
| A2A Agents | /v1/agents/ |
| MCP Servers | /mcp/ |
For the complete spec including request/response schemas, see the interactive reference above or download the OpenAPI JSON directly from your instance at /openapi.json.