API Reference

The full Routero AI API is documented below via the auto-generated OpenAPI spec from the gateway. The spec covers both the data plane (inference endpoints) and the management/control plane (key, team, org, budget, guardrail, prompt, memory, and token-saving plan management).

Base URL: https://api.routero.ai/v1 Authentication: Authorization: Bearer YOUR_ROUTERO_KEY on all requests.


Interactive reference


Key endpoint groups

Inference (data plane) — /v1/...

| Endpoint | Description | |—|—| | POST /chat/completions | OpenAI-compatible chat completions (primary endpoint) | | POST /completions | Legacy text completions | | POST /embeddings | Text embeddings | | POST /images/generations | Image generation | | POST /audio/speech | Text-to-speech | | POST /audio/transcriptions | Speech-to-text | | POST /rerank | Reranking (Cohere-compatible) | | POST /batches | Async batch processing | | GET /models | List available models | | POST /v1/messages | Anthropic Messages API compatibility |

Management (control plane) — /...

| Resource | Prefix | |—|—| | API Keys | /key/ | | Organizations | /organization/ | | Teams | /team/ | | Users | /user/ | | Budgets | /budget/ | | Billing & Wallet | /billing/ | | Guardrails | /guardrail/ | | Prompts | /prompts/ | | Memory Sessions | /memory/session/ | | Token-Saving Plans | /token-saving/plans/ | | Models | /model/ | | Routing / Fallbacks | /fallbacks/ | | A2A Agents | /v1/agents/ | | MCP Servers | /mcp/ |

For the complete spec including request/response schemas, see the interactive reference above or download the OpenAPI JSON directly from your instance at /openapi.json.