Overview
The AI Gateway sits between your applications and upstream model providers. It exposes a single OpenAI compatible API and handles routing, authentication, observability, and resiliency in one place.
For a deeper technical view, start with Architecture and Request lifecycle.
When to use it
- You need multiple providers with a consistent interface.
- You want failover and load balancing without rewriting app code.
- You want centralized logs, metrics, and request tracing.
How it works
- Your app sends OpenAI format requests to the gateway.
- The gateway applies routing, auth, and policy checks.
- The gateway forwards traffic to the selected provider.
- Responses and metrics are recorded and returned to your app.
Learn by depth
- Architecture: Understand components and system boundaries.
- Request lifecycle: Follow each processing stage and fallback path.
- Guides: Configure provider auth, routing policies, and controls.