Model Catalog

Free models

Model	Provider	Context	Best for
`minimax/minimax-m2.5:free`	MiniMax	197K	General chat
`z-ai/glm-4.5-air:free`	Z.AI	131K	Chinese, coding
`nvidia/nemotron-3-super-120b-a12b:free`	NVIDIA	262K	Reasoning
`openai/gpt-oss-120b:free`	OpenAI	131K	General purpose
`google/gemma-4-26b-a4b-it:free`	Google	262K	Lightweight tasks
`qwen/qwen3-coder:free`	Qwen	262K	Coding
`meta-llama/llama-3.3-70b-instruct:free`	Meta	66K	General chat

Paid frontier models

Model	Provider	Input	Output	Context	Recommended
`anthropic/claude-opus-4.7`	Anthropic	$5/M	$25/M	1M	✅
`google/gemini-3-flash-preview`	Google	$0.50/M	$3/M	1M	✅
`openai/gpt-5.5`	OpenAI	$5/M	$30/M	1M	✅
`google/gemini-3.1-pro-preview`	Google	$2/M	$12/M	1M	✅
`deepseek/deepseek-v4-pro`	DeepSeek	$1.74/M	$3.48/M	1M
`openai/gpt-5.5-pro`	OpenAI	$30/M	$180/M	1M
`openai/gpt-5.4`	OpenAI	$2.50/M	$15/M	1M
`deepseek/deepseek-v4-flash`	DeepSeek	$0.14/M	$0.28/M	1M
`openai/gpt-5.4-mini`	OpenAI	$0.75/M	$4.50/M	400K

Pricing per 1M tokens. Subject to change by providers.

Minimum requirements

Mona requires 64,000 tokens of context minimum. All models listed above meet this requirement.

Local models

See the Local LLMs on Mac guide for self-hosted options.

Configuration

Set your default model:

monoclaw config set model.default "anthropic/claude-sonnet-4"

Set a fallback:

monoclaw config set model.fallback "openai/gpt-5.5"

Model aliases

Create shortcuts:

model:
  aliases:
    fast: "openai/gpt-5.4-mini"
    smart: "anthropic/claude-opus-4.7"
    cheap: "deepseek/deepseek-v4-flash"
    chinese: "z-ai/glm-4.5-air"

Use them:

/model fast