Request lifecycle

Ingress: Gateway receives an OpenAI-compatible request.
Validation: Authentication and request shape checks run.
Policy checks: Rate limits, plugins, and controls are applied.
Routing decision: Strategy picks primary provider/model.
Execution: Request is sent upstream.
Recovery: Retry/fallback runs on eligible failures.
Egress: Response and metadata are returned to client.
Telemetry: Logs and metrics are emitted.

This page explains the full lifecycle from client request to provider response, including retries and fallback.

Lifecycle stages