AI gateway and model routing
A gateway sits between every consuming application and the underlying models. It enforces authentication, applies tenant and role-based access policies, routes requests to the model best suited to the workload (frontier vs open-weight vs domain-fine-tuned vs distilled), implements fallback paths when a primary provider is degraded, and handles rate limiting and quota management. Without this layer, every consuming application implements its own version of the same controls — inconsistently.