Skip to content

Model Routing & Providers

RedCloud is provider-agnostic for AI. You configure one or more providers, and a router directs each request to the right model with failover and a spend budget — so AI features stay available and cost-controlled.

ProviderNotes
Anthropic (Claude)Native support
OpenAIGPT models
Google GeminiGemini models
Google Vertex AIGemini via Vertex, Workload Identity supported
xAI (Grok)
Mistral
DeepSeek
OllamaLocal models (no data leaves your environment)

A range of models across these providers is available for selection.

The model router directs each request based on configuration, tracks provider health, and fails over to the next provider in the configured order if one is unavailable — so a single provider outage doesn’t break AI features.

A configurable USD budget (overall and per-tenant) bounds AI spend, and cost tracking records usage so you can see and trend AI cost. This keeps AI predictable and prevents runaway expense.

Providers, the default model, fallback order, and budgets are set under Administration → System Settings → AI Configuration and Model Routing. Local models (Ollama) keep inference entirely within your environment when data residency matters.

  • AI features may be gated by your license plan.
  • For strict data-residency needs, prefer a local provider (Ollama) or Vertex with Workload Identity.