@earendil-works/pi-ai catalog — currently 27 hosted providers (Anthropic, OpenAI, Google, Mistral, Groq, OpenRouter, DeepSeek, xAI, Cerebras, Amazon Bedrock, Azure OpenAI Responses, Vercel AI Gateway, Cloudflare AI Gateway / Workers AI, GitHub Copilot, Hugging Face, Fireworks, Together (via OpenRouter), Moonshot, Minimax, Z.ai, Kimi, OpenCode, and more). The catalog is queried at runtime — adding a new provider to pi-ai automatically lights it up across the daemon, web wizard, CLI wizard, and comis models list with zero comis code changes.
Plus you can register custom OpenAI-compatible endpoints (any HTTP service that speaks the OpenAI chat-completions API) for proxies and self-hosted gateways.
You can set a default model for all agents, override it per agent, and configure automatic failover so your agents keep working even if one provider goes down.
Discovering available providers and models
The catalog is dynamic. Two CLI commands surface it:models.list_providers / models.list RPC first, and falls back to the local pi-ai static catalog when the daemon isn’t running (so the wizard still works pre-init).
In the web dashboard, the same catalog drives:
- Setup Wizard › Provider step — provider grid populated from
models.list_providers - Setup Wizard › Model dropdown — per-provider model list from
models.list - Models view › Catalog tab — full searchable model catalog
- Agents view › Edit agent — provider and model dropdowns
getProviders() and getModels(provider) from @earendil-works/pi-ai.
Custom providers (proxies and self-hosted)
For any endpoint not in the pi-ai catalog — your own gateway, a private deployment, an OpenAI-compatible proxy — register a custom provider entry underproviders.entries.<id> with type: "openai" and your custom baseUrl. The daemon’s built-in-provider guard rejects redundant entries that try to re-create a built-in provider with the catalog’s default URL (those should use the built-in path: env_set + agent provider/model directly).
Choosing a model
You set a default AI model globally, and every agent uses that model unless you override it. The default is Claude Sonnet from Anthropic, but you can switch to any supported provider at any time. You can also create model aliases — friendly shorthand names that map to specific provider and model combinations. Instead of typing a long model ID every time, you can just use a name likeclaude or gpt4.
Three levels of configuration
Comis gives you three levels of control over which model your agents use. Each level overrides the one above it:Level 1: Global defaults
Set the default model and provider for all agents. This is the simplest way to get started — configure it once and every agent uses it.| Option | Type | Default | What it does |
|---|---|---|---|
models.defaultModel | string | "" (falls back to claude-sonnet-4-5-20250929) | Default model for all agents |
models.defaultProvider | string | "" (falls back to anthropic) | Default provider for all agents |
models.aliases | array | [] | Friendly names for model/provider pairs |
models.scanOnStartup | boolean | false | Scan providers for available models on startup |
~/.comis/config.yaml
claude would
use anthropic/claude-sonnet-4-5-20250929.
Level 2: Per-agent override
Override the global default for specific agents. This is useful when different agents have different needs — your main conversational agent might use Claude, while a code-focused agent uses GPT-4o, a fast triage agent uses Groq, and a local-only agent uses Ollama.| Option | Type | Default | What it does |
|---|---|---|---|
agents.*.model | string | "default" | Model for this agent (or "default" to use global) |
agents.*.provider | string | "default" | Provider for this agent (or "default" to use global) |
~/.comis/config.yaml
Level 3: Per-task routes
For advanced use cases, you can route specific tasks to different models. For example, use a cheaper model for summarization and your primary model for everything else.~/.comis/config.yaml
Automatic failover
What happens when your AI provider goes down or returns errors? Comis can automatically try backup models so your agents keep working.Fallback models
You can specify an ordered list of backup models. If the primary model fails, Comis tries the next one in the list, and so on until one succeeds or all options are exhausted.Auth profiles
You can configure multiple API keys per provider. Comis rotates between them automatically, which is useful for:- Rate limit management — If one key hits its rate limit, Comis switches to another.
- Redundancy — If one key is revoked or expired, others keep working.
Cooldown behavior
After a model fails, Comis waits before trying it again. The wait time increases with each failure using exponential backoff:- First failure: 1 minute cooldown
- Second failure: 5 minutes
- Third failure: 25 minutes
- Maximum: 1 hour cap
Failover configuration
| Option | Type | Default | What it does |
|---|---|---|---|
modelFailover.fallbackModels | array | [] | Ordered list of backup models to try |
modelFailover.authProfiles | array | [] | API key profiles for rotation |
modelFailover.allowedModels | array | [] | Model allowlist (empty = allow all) |
modelFailover.maxAttempts | number | 6 | Maximum total attempts across all models and keys |
modelFailover.cooldownInitialMs | number | 60000 | Initial cooldown after failure (1 minute) |
modelFailover.cooldownMultiplier | number | 5 | Cooldown multiplier for exponential backoff |
modelFailover.cooldownCapMs | number | 3600000 | Maximum cooldown duration (1 hour) |
~/.comis/config.yaml
Model capability matrix
Different model families support different features. The capability matrix governs how Comis builds the request body for each provider.| Family | Context window (typical) | Prompt cache | Tool use | Vision | Reasoning blocks |
|---|---|---|---|---|---|
| Anthropic Claude (Sonnet/Opus/Haiku 4.x) | 200K | Ephemeral 5m / 1h | Yes | Yes | Yes |
| OpenAI GPT-4o family | 128K | Implicit (provider-managed) | Yes | Yes | No |
| OpenAI GPT-5.4 family | 200K+ | Implicit | Yes | Yes | Optional |
| Google Gemini 2.5 / 3 | 1M | Explicit CachedContent | Yes | Yes | No |
| Mistral Large / Codestral | 128K | None | Yes (strict ID mode) | Pixtral only | No |
| Groq (Llama / Mixtral) | 128K | None | Yes | No | No |
| Ollama (local) | Varies by model | None | If model supports | If model supports | If model supports |
| OpenRouter | Per upstream | Per upstream | Per upstream | Per upstream | Per upstream |
| Cerebras (Llama) | 128K | None | Yes | No | No |
| xAI Grok | 128K | None | Yes | Limited | No |
| DeepSeek | 64K-128K | None | Yes | No | No |
getModels(provider)). To see the live catalog: comis models list (or filter to one provider with comis models list --provider <id>). The capability matrix above is a reference summary — concrete
per-model capabilities (cost, max tokens, vision support, reasoning) are
queryable directly via comis models list --format json or the
models.list RPC.
Key rotation
You can configure multiple API keys per provider to spread usage across accounts or handle key expiration gracefully. When a key hits a rate limit or returns an authentication error, Comis automatically switches to the next available key. Keys are managed through the Secret Manager. EachauthProfile entry maps a secret name to a provider, and Comis handles the
rotation logic automatically.
Safety
Budget protection, circuit breakers, and cost controls.
Routing
How messages are directed to specific agents.
