This page covers LLM/model providers (not chat channels like WhatsApp/Telegram). For model selection rules, see /concepts/models.
provider/model (example: opencode/claude-opus-4-6).agents.defaults.models, it becomes the allowlist.coderclaw onboard, coderclaw models list, coderclaw models set <provider/model>.CODERCLAW_LIVE_<PROVIDER>_KEY (single live override, highest priority)<PROVIDER>_API_KEYS (comma or semicolon list)<PROVIDER>_API_KEY (primary key)<PROVIDER>_API_KEY_* (numbered list, e.g. <PROVIDER>_API_KEY_1)GOOGLE_API_KEY is also included as fallback.429, rate_limit, quota, resource exhausted).CoderClaw ships with the pi‑ai catalog. These providers require no
models.providers config; just set auth + pick a model.
openaiOPENAI_API_KEYOPENAI_API_KEYS, OPENAI_API_KEY_1, OPENAI_API_KEY_2, plus CODERCLAW_LIVE_OPENAI_KEY (single override)openai/gpt-5.1-codexcoderclaw onboard --auth-choice openai-api-key{
agents: { defaults: { model: { primary: "openai/gpt-5.1-codex" } } },
}
anthropicANTHROPIC_API_KEY or claude setup-tokenANTHROPIC_API_KEYS, ANTHROPIC_API_KEY_1, ANTHROPIC_API_KEY_2, plus CODERCLAW_LIVE_ANTHROPIC_KEY (single override)anthropic/claude-opus-4-6coderclaw onboard --auth-choice token (paste setup-token) or coderclaw models auth paste-token --provider anthropic{
agents: { defaults: { model: { primary: "anthropic/claude-opus-4-6" } } },
}
openai-codexopenai-codex/gpt-5.3-codexcoderclaw onboard --auth-choice openai-codex or coderclaw models auth login --provider openai-codex{
agents: { defaults: { model: { primary: "openai-codex/gpt-5.3-codex" } } },
}
opencodeOPENCODE_API_KEY (or OPENCODE_ZEN_API_KEY)opencode/claude-opus-4-6coderclaw onboard --auth-choice opencode-zen{
agents: { defaults: { model: { primary: "opencode/claude-opus-4-6" } } },
}
googleGEMINI_API_KEYGEMINI_API_KEYS, GEMINI_API_KEY_1, GEMINI_API_KEY_2, GOOGLE_API_KEY fallback, and CODERCLAW_LIVE_GEMINI_KEY (single override)google/gemini-3-pro-previewcoderclaw onboard --auth-choice gemini-api-keygoogle-vertex, google-antigravity, google-gemini-cligoogle-antigravity-auth, disabled by default).
coderclaw plugins enable google-antigravity-authcoderclaw models auth login --provider google-antigravity --set-defaultgoogle-gemini-cli-auth, disabled by default).
coderclaw plugins enable google-gemini-cli-authcoderclaw models auth login --provider google-gemini-cli --set-defaultcoderclaw.json. The CLI login flow stores
tokens in auth profiles on the gateway host.zaiZAI_API_KEYzai/glm-4.7coderclaw onboard --auth-choice zai-api-key
z.ai/* and z-ai/* normalize to zai/*vercel-ai-gatewayAI_GATEWAY_API_KEYvercel-ai-gateway/anthropic/claude-opus-4.6coderclaw onboard --auth-choice ai-gateway-api-keyopenrouter (OPENROUTER_API_KEY)openrouter/anthropic/claude-sonnet-4-5xai (XAI_API_KEY)groq (GROQ_API_KEY)cerebras (CEREBRAS_API_KEY)
zai-glm-4.7 and zai-glm-4.6.https://api.cerebras.ai/v1.mistral (MISTRAL_API_KEY)github-copilot (COPILOT_GITHUB_TOKEN / GH_TOKEN / GITHUB_TOKEN)huggingface (HUGGINGFACE_HUB_TOKEN or HF_TOKEN) — OpenAI-compatible router; example model: huggingface/deepseek-ai/DeepSeek-R1; CLI: coderclaw onboard --auth-choice huggingface-api-key. See Hugging Face (Inference).models.providers (custom/base URL)Use models.providers (or models.json) to add custom providers or
OpenAI/Anthropic‑compatible proxies.
Moonshot uses OpenAI-compatible endpoints, so configure it as a custom provider:
moonshotMOONSHOT_API_KEYmoonshot/kimi-k2.5Kimi K2 model IDs:
{/moonshot-kimi-k2-model-refs:start/ && null}
moonshot/kimi-k2.5moonshot/kimi-k2-0905-previewmoonshot/kimi-k2-turbo-previewmoonshot/kimi-k2-thinkingmoonshot/kimi-k2-thinking-turbo
{/moonshot-kimi-k2-model-refs:end/ && null}{
agents: {
defaults: { model: { primary: "moonshot/kimi-k2.5" } },
},
models: {
mode: "merge",
providers: {
moonshot: {
baseUrl: "https://api.moonshot.ai/v1",
apiKey: "${MOONSHOT_API_KEY}",
api: "openai-completions",
models: [{ id: "kimi-k2.5", name: "Kimi K2.5" }],
},
},
},
}
Kimi Coding uses Moonshot AI’s Anthropic-compatible endpoint:
kimi-codingKIMI_API_KEYkimi-coding/k2p5{
env: { KIMI_API_KEY: "sk-..." },
agents: {
defaults: { model: { primary: "kimi-coding/k2p5" } },
},
}
Qwen provides OAuth access to Qwen Coder + Vision via a device-code flow. Enable the bundled plugin, then log in:
coderclaw plugins enable qwen-portal-auth
coderclaw models auth login --provider qwen-portal --set-default
Model refs:
qwen-portal/coder-modelqwen-portal/vision-modelSee /providers/qwen for setup details and notes.
Synthetic provides Anthropic-compatible models behind the synthetic provider:
syntheticSYNTHETIC_API_KEYsynthetic/hf:MiniMaxAI/MiniMax-M2.1coderclaw onboard --auth-choice synthetic-api-key{
agents: {
defaults: { model: { primary: "synthetic/hf:MiniMaxAI/MiniMax-M2.1" } },
},
models: {
mode: "merge",
providers: {
synthetic: {
baseUrl: "https://api.synthetic.new/anthropic",
apiKey: "${SYNTHETIC_API_KEY}",
api: "anthropic-messages",
models: [{ id: "hf:MiniMaxAI/MiniMax-M2.1", name: "MiniMax M2.1" }],
},
},
},
}
MiniMax is configured via models.providers because it uses custom endpoints:
--auth-choice minimax-apiMINIMAX_API_KEYSee /providers/minimax for setup details, model options, and config snippets.
Ollama is a local LLM runtime that provides an OpenAI-compatible API:
ollamaollama/llama3.3# Install Ollama, then pull a model:
ollama pull llama3.3
{
agents: {
defaults: { model: { primary: "ollama/llama3.3" } },
},
}
Ollama is automatically detected when running locally at http://127.0.0.1:11434/v1. See /providers/ollama for model recommendations and custom configuration.
vLLM is a local (or self-hosted) OpenAI-compatible server:
vllmhttp://127.0.0.1:8000/v1To opt in to auto-discovery locally (any value works if your server doesn’t enforce auth):
export VLLM_API_KEY="vllm-local"
Then set a model (replace with one of the IDs returned by /v1/models):
{
agents: {
defaults: { model: { primary: "vllm/your-model-id" } },
},
}
See /providers/vllm for details.
Example (OpenAI‑compatible):
{
agents: {
defaults: {
model: { primary: "lmstudio/minimax-m2.1-gs32" },
models: { "lmstudio/minimax-m2.1-gs32": { alias: "Minimax" } },
},
},
models: {
providers: {
lmstudio: {
baseUrl: "http://localhost:1234/v1",
apiKey: "LMSTUDIO_KEY",
api: "openai-completions",
models: [
{
id: "minimax-m2.1-gs32",
name: "MiniMax M2.1",
reasoning: false,
input: ["text"],
cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
contextWindow: 200000,
maxTokens: 8192,
},
],
},
},
},
}
Notes:
reasoning, input, cost, contextWindow, and maxTokens are optional.
When omitted, CoderClaw defaults to:
reasoning: falseinput: ["text"]cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 }contextWindow: 200000maxTokens: 8192coderclaw onboard --auth-choice opencode-zen
coderclaw models set opencode/claude-opus-4-6
coderclaw models list
See also: /gateway/configuration for full configuration examples.