Source: https://docs.datzi.ai/concepts/model-providers

Model providers

This page covers LLM/model providers (not chat channels like WhatsApp/Telegram). For model selection rules, see /concepts/models.

Quick rules

Model refs use provider/model (example: ollama/qwen3-coder:32b).
If you set agents.defaults.models, it becomes the allowlist.
CLI helpers: datzi onboard, datzi models list, datzi models set <provider/model>.

API key rotation

Supports generic provider rotation for selected providers.
Configure multiple keys via:
- DATZI_LIVE_<PROVIDER>_KEY (single live override, highest priority)
- <PROVIDER>_API_KEYS (comma or semicolon list)
- <PROVIDER>_API_KEY (primary key)
- <PROVIDER>_API_KEY_* (numbered list, e.g. <PROVIDER>_API_KEY_1)
For Google providers, GOOGLE_API_KEY is also included as fallback.
Key selection order preserves priority and deduplicates values.
Requests are retried with the next key only on rate-limit responses (for example 429, rate_limit, quota, resource exhausted).
Non-rate-limit failures fail immediately; no key rotation is attempted.
When all candidate keys fail, the final error is returned from the last attempt.

Built-in providers (pi-ai catalog)

Datzi ships with the pi‑ai catalog. These providers require no models.providers config; just set auth + pick a model.

OpenCode Zen

Provider: opencode
Auth: OPENCODE_API_KEY (or OPENCODE_ZEN_API_KEY)
Example model: ollama/qwen3-coder:32b
CLI: datzi onboard --auth-choice opencode-zen

{
  agents: {
    defaults: {
      model: {
        primary: 'ollama/qwen3-coder:32b'
      }
    }
  }
}

Z.AI (GLM)

Provider: zai
Auth: ZAI_API_KEY
Example model: zai/glm-4.7
CLI: datzi onboard --auth-choice zai-api-key
- Aliases: z.ai/* and z-ai/* normalize to zai/*

Vercel AI Gateway

Provider: vercel-ai-gateway
Auth: AI_GATEWAY_API_KEY
Example model: vercel-ai-gateway/ollama/qwen3-coder:32b
CLI: datzi onboard --auth-choice ai-gateway-api-key

Other built-in providers

OpenRouter: openrouter (OPENROUTER_API_KEY)
Example model: openrouter/ollama/qwen3-coder:14b
Groq: groq (GROQ_API_KEY)
Cerebras: cerebras (CEREBRAS_API_KEY)
- GLM models on Cerebras use ids zai-glm-4.7 and zai-glm-4.6.
- OpenAI-compatible base URL: https://api.cerebras.ai/v1.
Mistral: mistral (MISTRAL_API_KEY)
GitHub Copilot: github-copilot (COPILOT_GITHUB_TOKEN / GH_TOKEN / GITHUB_TOKEN)
Hugging Face Inference: huggingface (HUGGINGFACE_HUB_TOKEN or HF_TOKEN) — OpenAI-compatible router; example model: huggingface/deepseek-ai/DeepSeek-R1; CLI: datzi onboard --auth-choice huggingface-api-key. See Hugging Face (Inference).

Providers via `models.providers` (custom/base URL)

Use models.providers (or models.json) to add custom providers or OpenAI/Anthropic‑compatible proxies.

Moonshot AI (Kimi)

Moonshot uses OpenAI-compatible endpoints, so configure it as a custom provider:

Provider: moonshot
Auth: MOONSHOT_API_KEY
Example model: moonshot/kimi-k2.5

Kimi K2 model IDs:

moonshot/kimi-k2.5
moonshot/kimi-k2-0905-preview
moonshot/kimi-k2-turbo-preview
moonshot/kimi-k2-thinking
moonshot/kimi-k2-thinking-turbo

{
  agents: {
    defaults: {
      model: {
        primary: 'moonshot/kimi-k2.5'
      }
    }
  },
  models: {
    mode: 'merge',
    providers: {
      moonshot: {
        baseUrl: 'https://api.moonshot.ai/v1',
        apiKey: '${MOONSHOT_API_KEY}',
        api: 'openai-completions',
        models: [
          {
            id: 'kimi-k2.5',
            name: 'Kimi K2.5'
          }
        ]
      }
    }
  }
}

Kimi Coding

Kimi Coding uses Moonshot AI’s Anthropic-compatible endpoint:

Provider: kimi-coding
Auth: KIMI_API_KEY
Example model: kimi-coding/k2p5

{
  env: {
    KIMI_API_KEY: 'sk-...'
  },
  agents: {
    defaults: {
      model: {
        primary: 'kimi-coding/k2p5'
      }
    }
  }
}

Qwen OAuth (free tier)

Qwen provides OAuth access to Qwen Coder + Vision via a device-code flow. Enable the bundled plugin, then log in:

datzi plugins enable qwen-portal-auth
datzi models auth login --provider qwen-portal --set-default

Model refs:

qwen-portal/coder-model
qwen-portal/vision-model

See /providers/qwen for setup details and notes.

Volcano Engine (Doubao)

Volcano Engine (火山引擎) provides access to Doubao and other models in China.

Provider: volcengine (coding: volcengine-plan)
Auth: VOLCANO_ENGINE_API_KEY
Example model: volcengine/doubao-seed-1-8-251228
CLI: datzi onboard --auth-choice volcengine-api-key

{
  agents: {
    defaults: {
      model: {
        primary: 'volcengine/doubao-seed-1-8-251228'
      }
    }
  }
}

Available models:

volcengine/doubao-seed-1-8-251228 (Doubao Seed 1.8)
volcengine/doubao-seed-code-preview-251028
volcengine/kimi-k2-5-260127 (Kimi K2.5)
volcengine/glm-4-7-251222 (GLM 4.7)
volcengine/deepseek-v3-2-251201 (DeepSeek V3.2 128K)

Coding models (volcengine-plan):

volcengine-plan/ark-code-latest
volcengine-plan/doubao-seed-code
volcengine-plan/kimi-k2.5
volcengine-plan/kimi-k2-thinking
volcengine-plan/glm-4.7

BytePlus (International)

BytePlus ARK provides access to the same models as Volcano Engine for international users.

Provider: byteplus (coding: byteplus-plan)
Auth: BYTEPLUS_API_KEY
Example model: byteplus/seed-1-8-251228
CLI: datzi onboard --auth-choice byteplus-api-key

{
  agents: {
    defaults: {
      model: {
        primary: 'byteplus/seed-1-8-251228'
      }
    }
  }
}

Available models:

byteplus/seed-1-8-251228 (Seed 1.8)
byteplus/kimi-k2-5-260127 (Kimi K2.5)
byteplus/glm-4-7-251222 (GLM 4.7)

Coding models (byteplus-plan):

byteplus-plan/ark-code-latest
byteplus-plan/doubao-seed-code
byteplus-plan/kimi-k2.5
byteplus-plan/kimi-k2-thinking
byteplus-plan/glm-4.7

Synthetic

Synthetic provides Anthropic-compatible models behind the synthetic provider:

Provider: synthetic
Auth: SYNTHETIC_API_KEY
Example model: synthetic/hf:MiniMaxAI/MiniMax-M2.1
CLI: datzi onboard --auth-choice synthetic-api-key

{
  agents: {
    defaults: {
      model: {
        primary: 'synthetic/hf:MiniMaxAI/MiniMax-M2.1'
      }
    }
  },
  models: {
    mode: 'merge',
    providers: {
      synthetic: {
        baseUrl: 'https://api.synthetic.new/anthropic',
        apiKey: '${SYNTHETIC_API_KEY}',
        api: 'anthropic-messages',
        models: [
          {
            id: 'hf:MiniMaxAI/MiniMax-M2.1',
            name: 'MiniMax M2.1'
          }
        ]
      }
    }
  }
}

MiniMax

MiniMax is configured via models.providers because it uses custom endpoints:

MiniMax (Anthropic‑compatible): --auth-choice minimax-api
Auth: MINIMAX_API_KEY

See /providers/minimax for setup details, model options, and config snippets.

Ollama

Ollama is a local LLM runtime that provides an OpenAI-compatible API:

Provider: ollama
Auth: None required (local server)
Example model: ollama/llama3.3
Installation: https://ollama.ai

# Install Ollama, then pull a model:
ollama pull llama3.3

{
  agents: {
    defaults: {
      model: {
        primary: 'ollama/llama3.3'
      }
    }
  }
}

Ollama is automatically detected when running locally at http://127.0.0.1:11434/v1. See /providers/ollama for model recommendations and custom configuration.

vLLM

vLLM is a local (or self-hosted) OpenAI-compatible server:

Provider: vllm
Auth: Optional (depends on your server)
Default base URL: http://127.0.0.1:8000/v1

To opt in to auto-discovery locally (any value works if your server doesn’t enforce auth):

export VLLM_API_KEY="vllm-local"

Then set a model (replace with one of the IDs returned by /v1/models):

{
  agents: {
    defaults: {
      model: {
        primary: 'vllm/your-model-id'
      }
    }
  }
}

See /providers/vllm for details.

Local proxies (LM Studio, vLLM, LiteLLM, etc.)

Example (OpenAI‑compatible):

{
  agents: {
    defaults: {
      model: {
        primary: 'lmstudio/minimax-m2.1-gs32'
      },
      models: {
        'lmstudio/minimax-m2.1-gs32': {
          alias: 'Minimax'
        }
      }
    }
  },
  models: {
    providers: {
      lmstudio: {
        baseUrl: 'http://localhost:1234/v1',
        apiKey: 'LMSTUDIO_KEY',
        api: 'openai-completions',
        models: [
          {
            id: 'minimax-m2.1-gs32',
            name: 'MiniMax M2.1',
            reasoning: false,
            input: ['text'],
            cost: {
              input: 0,
              output: 0,
              cacheRead: 0,
              cacheWrite: 0
            },
            contextWindow: 200000,
            maxTokens: 8192
          }
        ]
      }
    }
  }
}

Notes:

For custom providers, reasoning, input, cost, contextWindow, and maxTokens are optional. When omitted, Datzi defaults to:
- reasoning: false
- input: ["text"]
- cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 }
- contextWindow: 200000
- maxTokens: 8192
Recommended: set explicit values that match your proxy/model limits.

CLI examples

datzi onboard --auth-choice opencode-zen
datzi models set ollama/qwen3-coder:32b
datzi models list

See also: /gateway/configuration for full configuration examples.

Getting Started

Channels

Gateway

Automation

Concepts

Tools

Platforms

Install

Nodes

Reference

Help

Model providers

Model providers

Quick rules

API key rotation

Built-in providers (pi-ai catalog)

OpenCode Zen

Z.AI (GLM)

Vercel AI Gateway

Other built-in providers

Providers via `models.providers` (custom/base URL)

Moonshot AI (Kimi)

Kimi Coding

Qwen OAuth (free tier)

Volcano Engine (Doubao)

BytePlus (International)

Synthetic

MiniMax

Ollama

vLLM

Local proxies (LM Studio, vLLM, LiteLLM, etc.)

CLI examples

Models CLI

Getting Started

Channels

Gateway

Automation

Concepts

Tools

Platforms

Install

Nodes

Reference

Help

​Model providers

​Quick rules

​API key rotation

​Built-in providers (pi-ai catalog)

​OpenCode Zen

​Z.AI (GLM)

​Vercel AI Gateway

​Other built-in providers

​Providers via models.providers (custom/base URL)

​Moonshot AI (Kimi)

​Kimi Coding

​Qwen OAuth (free tier)

​Volcano Engine (Doubao)

​BytePlus (International)

​Synthetic

​MiniMax

​Ollama

​vLLM

​Local proxies (LM Studio, vLLM, LiteLLM, etc.)

​CLI examples

​Models CLI

Model providers

Quick rules

API key rotation

Built-in providers (pi-ai catalog)

OpenCode Zen

Z.AI (GLM)

Vercel AI Gateway

Other built-in providers

Providers via `models.providers` (custom/base URL)

Moonshot AI (Kimi)

Kimi Coding

Qwen OAuth (free tier)

Volcano Engine (Doubao)

BytePlus (International)

Synthetic

MiniMax

Ollama

vLLM

Local proxies (LM Studio, vLLM, LiteLLM, etc.)

CLI examples

Models CLI