Model Catalog
Free models
| Model | Provider | Context | Best for |
|---|---|---|---|
minimax/minimax-m2.5:free | MiniMax | 197K | General chat |
z-ai/glm-4.5-air:free | Z.AI | 131K | Chinese, coding |
nvidia/nemotron-3-super-120b-a12b:free | NVIDIA | 262K | Reasoning |
openai/gpt-oss-120b:free | OpenAI | 131K | General purpose |
google/gemma-4-26b-a4b-it:free | 262K | Lightweight tasks | |
qwen/qwen3-coder:free | Qwen | 262K | Coding |
meta-llama/llama-3.3-70b-instruct:free | Meta | 66K | General chat |
Paid frontier models
| Model | Provider | Input | Output | Context | Recommended |
|---|---|---|---|---|---|
anthropic/claude-opus-4.7 | Anthropic | $5/M | $25/M | 1M | ✅ |
google/gemini-3-flash-preview | $0.50/M | $3/M | 1M | ✅ | |
openai/gpt-5.5 | OpenAI | $5/M | $30/M | 1M | ✅ |
google/gemini-3.1-pro-preview | $2/M | $12/M | 1M | ✅ | |
deepseek/deepseek-v4-pro | DeepSeek | $1.74/M | $3.48/M | 1M | |
openai/gpt-5.5-pro | OpenAI | $30/M | $180/M | 1M | |
openai/gpt-5.4 | OpenAI | $2.50/M | $15/M | 1M | |
deepseek/deepseek-v4-flash | DeepSeek | $0.14/M | $0.28/M | 1M | |
openai/gpt-5.4-mini | OpenAI | $0.75/M | $4.50/M | 400K |
Pricing per 1M tokens. Subject to change by providers.
Minimum requirements
Mona requires 64,000 tokens of context minimum. All models listed above meet this requirement.
Local models
See the Local LLMs on Mac guide for self-hosted options.
Configuration
Set your default model:
monoclaw config set model.default "anthropic/claude-sonnet-4"
Set a fallback:
monoclaw config set model.fallback "openai/gpt-5.5"
Model aliases
Create shortcuts:
model:
aliases:
fast: "openai/gpt-5.4-mini"
smart: "anthropic/claude-opus-4.7"
cheap: "deepseek/deepseek-v4-flash"
chinese: "z-ai/glm-4.5-air"
Use them:
/model fast