Routing plane
Classify sensitivity and task, apply policy, select the best eligible model, and dispatch — synchronously on every request.
Architecture
Cardix runs as a single service within your network. Applications connect via an OpenAI-compatible endpoint; Cardix handles classification, policy, selection, and dispatch.
Planes
Routing, learning, and observability work together — without exposing your data or blocking your responses.
Classify sensitivity and task, apply policy, select the best eligible model, and dispatch — synchronously on every request.
Improves routing over time from traffic signals. Runs asynchronously and never blocks the response.
Full audit trail, spend vs baseline comparison, and visibility into blocked demand.
Request journey
Five steps, milliseconds — policy before routing, every time.
Validate the caller and attribute the request to your organization.
Determine data sensitivity and task category for the prompt.
Evaluate residency rules and produce allowed deployment locations. Fail closed if no rule matches.
Choose the best model from your eligible catalog for this request.
Route to the chosen backend and return the response, streaming or buffered.