LLM Models

Name: OATDA - AI Models Catalog
Availability: InStock

Vision Capable

Image Generation

Video Generation

Audio Generation

Alibaba


deepseek-v4-flash 🇪🇺 EU	0.14 $	0.28 $	0.028 $	1.0M
deepseek-v4-pro 🇪🇺 EU	1.65 $	3.3 $	0.137522 $	1.0M
glm-5.2 🇪🇺 EU	1.1 $	3.851 $	0.22 $	1.0M
happyhorse-1.1-i2vVideo Generation 🇪🇺 EU	0.123769 $	0.165026 $	-	-
happyhorse-1.1-t2vVideo Generation 🇪🇺 EU	0.123769 $	0.165026 $	-	-
kimi-k2.7-codeVision Capable 🇪🇺 EU	0.894 $	3.713 $	0.179 $	262K
qwen-flash 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 128K ↓ 0.022 $↑ 0.216 $ 128K - 256K ↓ 0.087 $↑ 0.861 $ 256K - 1.0M ↓ 0.173 $↑ 1.721 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen-flash-2025-07-28 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 128K ↓ 0.022 $↑ 0.216 $ 128K - 256K ↓ 0.087 $↑ 0.861 $ 256K - 1.0M ↓ 0.173 $↑ 1.721 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen-imageImage Generation	0.035 $	0.035 $	0 $	-
qwen-image-2.0Image Generation 🇪🇺 EU	0.028671 $	0.028671 $	-	-
qwen-image-2.0-proImage Generation 🇪🇺 EU	0.071676 $	0.071676 $	-	-
qwen-image-editImage Generation 🇪🇺 EU	0.043 $	0.043 $	-	-
qwen-max	1.6 $	6.4 $	-	33K
qwen-max-2025-01-25	1.6 $	6.4 $	-	33K
qwen-max-latest	1.6 $	6.4 $	-	33K
qwen-plus 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 256K ↓ 0.4 $↑ 1.2 $ 256K - 1.0M ↓ 1.2 $↑ 3.6 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen-plus-2025-07-28 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 128K ↓ 0.115 $↑ 0.287 $ 128K - 256K ↓ 0.345 $↑ 2.868 $ 256K - 1.0M ↓ 0.689 $↑ 6.881 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen-plus-2025-09-11 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 128K ↓ 0.115 $↑ 0.287 $ 128K - 256K ↓ 0.345 $↑ 2.868 $ 256K - 1.0M ↓ 0.689 $↑ 6.881 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen-plus-2025-12-01 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 256K ↓ 0.4 $↑ 1.2 $ 256K - 1.0M ↓ 1.2 $↑ 3.6 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen-plus-latest 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 256K ↓ 0.4 $↑ 1.2 $ 256K - 1.0M ↓ 1.2 $↑ 3.6 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen-turbo	0.05 $	0.2 $	-	1.0M
qwen-turbo-thinking	0.05 $	0.5 $	-	131K
qwen3-coder-flash 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 32K ↓ 0.144 $↑ 0.574 $ 32K - 128K ↓ 0.216 $↑ 0.861 $ 128K - 256K ↓ 0.359 $↑ 1.434 $ 256K - 1.0M ↓ 0.717 $↑ 3.584 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen3-coder-flash-2025-07-28 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 32K ↓ 0.144 $↑ 0.574 $ 32K - 128K ↓ 0.216 $↑ 0.861 $ 128K - 256K ↓ 0.359 $↑ 1.434 $ 256K - 1.0M ↓ 0.717 $↑ 3.584 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen3-coder-plus 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 32K ↓ 0.574 $↑ 2.294 $ 32K - 128K ↓ 0.861 $↑ 3.441 $ 128K - 256K ↓ 1.434 $↑ 5.735 $ 256K - 1.0M ↓ 2.868 $↑ 28.671 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen3-max 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 32K ↓ 1.2 $↑ 6 $ 32K - 128K ↓ 2.4 $↑ 12 $ 128K - 252K ↓ 3 $↑ 15 $ Cache: Hit 20% / Explicit 10% / Creation 125%			262K
qwen3-max-preview 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 32K ↓ 0.861 $↑ 3.441 $ 32K - 128K ↓ 1.434 $↑ 5.735 $ 128K - 256K ↓ 2.151 $↑ 8.602 $ Cache: Hit 20% / Explicit 10% / Creation 125%			262K
qwen3.5-397b-a17bVision Capable 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 128K ↓ 0.172 $↑ 1.032 $ 128K - 256K ↓ 0.43 $↑ 2.58 $ Cache: Hit 20% / Explicit 10% / Creation 125%			256K
qwen3.5-flash 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 1.0M ↓ 0.1 $↑ 0.4 $ > 1.0M ↓ 0.1 $↑ 0.4 $			1.0M
qwen3.5-flash-2026-02-23 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 1.0M ↓ 0.1 $↑ 0.4 $ > 1.0M ↓ 0.1 $↑ 0.4 $			1.0M
qwen3.5-plusVision Capable 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 128K ↓ 0.115 $↑ 0.688 $ 128K - 256K ↓ 0.287 $↑ 1.72 $ 256K - 1.0M ↓ 0.573 $↑ 3.44 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen3.5-plus-2026-02-15Vision Capable 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 128K ↓ 0.115 $↑ 0.688 $ 128K - 256K ↓ 0.287 $↑ 1.72 $ 256K - 1.0M ↓ 0.573 $↑ 3.44 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen3.6-flash 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 256K ↓ 0.165 $↑ 0.99 $ > 256K ↓ 0.66 $↑ 3.961 $			1.0M
qwen3.6-flash-2026-04-16 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 256K ↓ 0.165 $↑ 0.99 $ > 256K ↓ 0.66 $↑ 3.961 $			1.0M
qwen3.6-max-preview 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 128K ↓ 1.3 $↑ 7.8 $ 128K - 256K ↓ 2 $↑ 12 $ Cache: Hit 20% / Explicit 10% / Creation 125%			256K
qwen3.6-plusVision Capable 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 256K ↓ 0.276 $↑ 1.651 $ 256K - 1.0M ↓ 1.101 $↑ 6.602 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen3.6-plus-2026-04-02Vision Capable 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 256K ↓ 0.276 $↑ 1.651 $ 256K - 1.0M ↓ 1.101 $↑ 6.602 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen3.7-max 🇪🇺 EU	1.65 $	4.951 $	0.165 $	1.0M
qwen3.7-max-2026-05-17 🇪🇺 EU	1.65 $	4.951 $	0.165 $	1.0M
qwen3.7-max-2026-05-20 🇪🇺 EU	1.65 $	4.951 $	0.165 $	1.0M
qwen3.7-max-2026-06-08 🇪🇺 EU	1.65 $	4.951 $	0.165 $	1.0M
qwen3.7-max-preview 🇪🇺 EU	1.65 $	4.951 $	0.165 $	1.0M
qwen3.7-plusVision Capable 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 256K ↓ 0.276 $↑ 1.101 $ 256K - 1.0M ↓ 0.826 $↑ 3.301 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
qwen3.7-plus-2026-05-26Vision Capable 🇪🇺 EU	ContextIn/Out per 1M tok ≤ 256K ↓ 0.276 $↑ 1.101 $ 256K - 1.0M ↓ 0.826 $↑ 3.301 $ Cache: Hit 20% / Explicit 10% / Creation 125%			1.0M
wan2.5-t2v-previewVideo Generation	0.15 $	0.15 $	0 $	-
wan2.6-i2vVideo Generation 🇪🇺 EU	0.086012 $	0.143353 $	-	-
wan2.6-i2v-flashVideo Generation 🇪🇺 EU	0.043006 $	0.071676 $	-	-
wan2.6-imageImage Generation 🇪🇺 EU	0.028671 $	0.028671 $	-	-
wan2.6-r2vVideo Generation 🇪🇺 EU	0.086012 $	0.143353 $	-	-
wan2.6-r2v-flashVideo Generation 🇪🇺 EU	0.043006 $	0.071676 $	-	-
wan2.6-t2iImage Generation 🇪🇺 EU	0.028671 $	0.028671 $	-	-
wan2.6-t2vVideo Generation 🇪🇺 EU	0.086012 $	0.143353 $	-	-
wan2.7-imageImage Generation 🇪🇺 EU	0.028671 $	0.028671 $	-	-
wan2.7-image-proImage Generation 🇪🇺 EU	0.068761 $	0.068761 $	-	-

Anthropic


claude-haiku-4-5-20251001Vision Capable	1 $	5 $	0.1 $	200K
claude-opus-4-1-20250805Vision Capable	15 $	75 $	1.5 $	200K
claude-opus-4-5-20251101Vision Capable	5 $	25 $	0.5 $	200K
claude-opus-4-6Vision Capable	5 $	25 $	0.5 $	1.0M
claude-opus-4-7Vision Capable	5 $	25 $	0.5 $	1.0M
claude-opus-4-8Vision Capable	5 $	25 $	0.5 $	1.0M
claude-sonnet-4-5-20250929Vision Capable	3 $	15 $	0.3 $	200K
claude-sonnet-4-6Vision Capable	3 $	15 $	0.3 $	1.0M
claude-sonnet-5Vision Capable	2 $	10 $	0.2 $	1.0M

Microsoft Azure (EU)


kimi-k2.5Vision Capable	0.6 $	3 $	0.1 $	200K

Bytedance


seed-1-6-250915Vision Capable	ContextIn/Out per 1M tok ≤ 128K ↓ 0.25 $↑ 2 $ > 128K ↓ 0.5 $↑ 4 $ Cache: 0.05 $ per million			256K
seed-1-8-251228Vision Capable	ContextIn/Out per 1M tok ≤ 128K ↓ 0.25 $↑ 2 $ > 128K ↓ 0.5 $↑ 4 $ Cache: 0.05 $ per million			256K
seedance-1-0-pro-250528Video Generation	2.5 $	2.5 $	0 $	-
seedance-1-0-pro-fast-251015Video Generation	1 $	1 $	0 $	-
seedance-1-5-pro-251215Video Generation	2.4 $	2.4 $	0 $	-
seedream-4-0-250828Image Generation	0.03 $	0.03 $	-	-
seedream-4-5-251128Image Generation	0.04 $	0.04 $	-	-
seedream-5-0-260128Image Generation	0.035 $	0.035 $	-	-
seedream-5-0-lite-260128Image Generation	0.035 $	0.035 $	-	-
seedream-5-0-pro-260628Image Generation	0.045 $	0.045 $	-	-

Deepinfra


deepseek-ai/DeepSeek-R1-0528	0.5 $	2.15 $	0.35 $	128K
google/gemma-4-26B-A4B-itVision Capable	0.08 $	0.35 $	0.01 $	262K
google/gemma-4-31B-itVision Capable	0.13 $	0.38 $	0.02 $	262K
moonshotai/Kimi-K2.5Vision Capable	0.45 $	2.25 $	0.07 $	262K

Deepseek


deepseek-v4-flash	0.14 $	0.28 $	0.0028 $	1.0M
deepseek-v4-pro	0.435 $	0.87 $	0.003625 $	1.0M

Google


gemini-2.5-flashVision Capable	0.3 $	2.5 $	0.03 $	1.0M
gemini-2.5-flash-imageImage Generation	0.3 $	2.5 $	0.03 $	-
gemini-2.5-flash-liteVision Capable	0.1 $	0.4 $	0.01 $	1.0M
gemini-2.5-flash-preview-ttsAudio Generation	0.5 $	10 $	-	8K
gemini-2.5-proVision Capable	ContextIn/Out per 1M tok ≤ 200K ↓ 1.25 $↑ 10 $ > 200K ↓ 2.5 $↑ 15 $ Cache (≤ 200K):Read 0.125 $ Cache (> 200K):Read 0.25 $			1.0M
gemini-2.5-pro-preview-ttsAudio Generation	1 $	20 $	-	8K
gemini-3-flash-previewVision Capable	0.5 $	3 $	0.05 $	1.0M
gemini-3-pro-image-previewImage Generation	2 $	12 $	-	1.0M
gemini-3-pro-previewVision Capable	ContextIn/Out per 1M tok ≤ 200K ↓ 2 $↑ 12 $ > 200K ↓ 4 $↑ 18 $ Cache (≤ 200K):Read 0.2 $ Cache (> 200K):Read 0.4 $			1.0M
gemini-3.1-flash-image-previewImage Generation	0.5 $	3 $	-	1.0M
gemini-3.1-flash-lite-previewVision Capable	ContextIn/Out per 1M tok ≤ 200K ↓ 0.25 $↑ 1.5 $ > 200K ↓ 0.25 $↑ 1.5 $ Cache (≤ 200K):Read 0.03 $ Cache (> 200K):Read 0.03 $			1.0M
gemini-3.1-flash-tts-previewAudio Generation	1 $	20 $	-	8K
gemini-3.1-pro-previewVision Capable	ContextIn/Out per 1M tok ≤ 200K ↓ 2 $↑ 12 $ > 200K ↓ 4 $↑ 18 $ Cache (≤ 200K):Read 0.2 $ Cache (> 200K):Read 0.4 $			1.0M
gemini-3.5-flashVision Capable	1.5 $	9 $	0.15 $	1.0M
imagen-4.0-fast-generate-001Image Generation Deprecated Aug 17, 2026	0.02 $	0.02 $	-	1.0M
imagen-4.0-generate-001Image Generation Deprecated Aug 17, 2026	0.04 $	0.04 $	-	1.0M
imagen-4.0-ultra-generate-001Image Generation Deprecated Aug 17, 2026	0.06 $	0.06 $	-	1.0M
veo-3.0-fast-generate-001Video Generation	0.15 $	0.15 $	0.15 $	1K
veo-3.0-generate-001Video Generation	0.4 $	0.4 $	0.4 $	-
veo-3.1-fast-generate-previewVideo Generation	0.15 $	0.15 $	-	-
veo-3.1-generate-previewVideo Generation	0.4 $	0.4 $	-	-
veo-3.1-lite-generate-previewVideo Generation	0.05 $	0.05 $	-	-

MiniMax


I2V-01-DirectorVideo Generation	0.43 $	0.43 $	-	-
image-01Image Generation	0.0035 $	0.0035 $	-	-
MiniMax-Hailuo-02Video Generation	0.56 $	0.56 $	-	-
MiniMax-Hailuo-2.3Video Generation	0.56 $	0.56 $	-	-
MiniMax-Hailuo-2.3-FastVideo Generation	0.32 $	0.32 $	-	-
minimax-m2	0.3 $	1.2 $	0.03 $	200K
minimax-m2.1	0.3 $	1.2 $	0.03 $	200K
minimax-m2.5	0.3 $	1.2 $	0.03 $	200K
minimax-m2.5-highspeed	0.3 $	3.6 $	0.03 $	200K
minimax-m2.7	0.3 $	1.2 $	0.06 $	205K
minimax-m2.7-highspeed	0.3 $	2.4 $	0.06 $	205K
minimax-m3Vision Capable	ContextIn/Out per 1M tok ≤ 512K ↓ 0.3 $↑ 1.2 $ > 512K ↓ 0.6 $↑ 2.4 $ Cache (≤ 512K):Read 0.06 $ Cache (> 512K):Read 0.12 $			1.0M
MiniMax-Text-01Vision Capable	0.2 $	1.1 $	-	1.0M
S2V-01Video Generation	0.65 $	0.65 $	-	-
T2V-01-DirectorVideo Generation	0.43 $	0.43 $	-	-

Mistral


codestral-latest	0.3 $	0.9 $	-	256K
mistral-large-latestVision Capable	0.5 $	1.5 $	-	256K
mistral-medium-latestVision Capable	1.5 $	7.5 $	-	256K

Moonshot AI


kimi-k2.5Vision Capable	0.6 $	3 $	0.1 $	262K
kimi-k2.6Vision Capable	0.95 $	4 $	0.16 $	262K
kimi-k2.7-codeVision Capable	0.95 $	4 $	0.19 $	262K
kimi-latestVision Capable	ContextIn/Out per 1M tok ≤ 8K ↓ 0.2 $↑ 2 $ 8K - 33K ↓ 1 $↑ 3 $ 33K - 131K ↓ 2 $↑ 5 $			131K

OpenAI


chat-latestVision Capable	5 $	30 $	0.5 $	1.0M
gpt-3.5-turbo	0.5 $	1.5 $	1.25 $	16K
gpt-4	30 $	60 $	-	130K
gpt-4-turboVision Capable	10 $	30 $	-	128K
gpt-4.1Vision Capable	2 $	8 $	0.5 $	1.0M
gpt-4.1-miniVision Capable	0.4 $	1.6 $	0.1 $	1.0M
gpt-4.1-nanoVision Capable	0.1 $	0.4 $	0.025 $	1.0M
gpt-4oVision Capable	2.5 $	10 $	1.25 $	128K
gpt-4o-miniVision Capable	0.15 $	0.6 $	0.075 $	128K
gpt-4o-mini-transcribeAudio Generation	1.25 $	5 $	-	16K
gpt-4o-mini-ttsAudio Generation	0.6 $	12 $	-	-
gpt-4o-transcribeAudio Generation	2.5 $	10 $	-	16K
gpt-4o-transcribe-diarizeAudio Generation	2.5 $	10 $	-	16K
gpt-5Vision Capable	1.25 $	10 $	0.125 $	400K
gpt-5-miniVision Capable	0.25 $	2 $	0.025 $	400K
gpt-5-nanoVision Capable	0.05 $	0.4 $	0.005 $	400K
gpt-5.2Vision Capable	1.75 $	14 $	0.175 $	400K
gpt-5.2-proVision Capable	21 $	168 $	-	400K
gpt-5.3-codex	1.75 $	14 $	0.175 $	200K
gpt-5.4Vision Capable	ContextIn/Out per 1M tok ≤ 272K ↓ 2.5 $↑ 15 $ > 272K ↓ 5 $↑ 22.5 $ Cache (≤ 272K):Read 0.25 $ Cache (> 272K):Read 0.5 $			1.1M
gpt-5.4-miniVision Capable	0.75 $	4.5 $	0.075 $	400K
gpt-5.4-nanoVision Capable	0.2 $	1.25 $	0.02 $	400K
gpt-5.4-proVision Capable	ContextIn/Out per 1M tok ≤ 272K ↓ 30 $↑ 180 $ > 272K ↓ 60 $↑ 270 $			1.1M
gpt-5.5Vision Capable	ContextIn/Out per 1M tok ≤ 272K ↓ 5 $↑ 30 $ > 272K ↓ 10 $↑ 45 $ Cache (≤ 272K):Read 0.5 $ Cache (> 272K):Read 1 $			1.1M
gpt-5.5-proVision Capable	ContextIn/Out per 1M tok ≤ 272K ↓ 30 $↑ 180 $ > 272K ↓ 60 $↑ 270 $			1.1M
gpt-5.6-lunaVision Capable	ContextIn/Out per 1M tok ≤ 272K ↓ 1 $↑ 6 $ > 272K ↓ 2 $↑ 9 $ Cache (≤ 272K):Read 0.1 $ / Write 1.25 $ Cache (> 272K):Read 0.2 $ / Write 2.5 $			1.1M
gpt-5.6-solVision Capable	ContextIn/Out per 1M tok ≤ 272K ↓ 5 $↑ 30 $ > 272K ↓ 10 $↑ 45 $ Cache (≤ 272K):Read 0.5 $ / Write 6.25 $ Cache (> 272K):Read 1 $ / Write 12.5 $			1.1M
gpt-5.6-terraVision Capable	ContextIn/Out per 1M tok ≤ 272K ↓ 2.5 $↑ 15 $ > 272K ↓ 5 $↑ 22.5 $ Cache (≤ 272K):Read 0.25 $ / Write 3.125 $ Cache (> 272K):Read 0.5 $ / Write 6.25 $			1.1M
gpt-image-1Image Generation	5 $	0 $	1.25 $	-
gpt-image-1-miniImage Generation	2.5 $	0 $	0.25 $	-
gpt-image-1.5Image Generation	5 $	0 $	1.25 $	-
gpt-image-2Image Generation	5 $	0 $	1.25 $	-
o1Vision Capable	15 $	60 $	7.5 $	200K
o1-proVision Capable	150 $	600 $	-	200K
o3Vision Capable	2 $	8 $	0.5 $	200K
o3-mini	1.1 $	4.4 $	0.55 $	200K
o4-miniVision Capable	1.1 $	4.4 $	0.275 $	200K
sora-2Video Generation Deprecated Sep 24, 2026	0.1 $	0.1 $	-	-
sora-2-proVideo Generation Deprecated Sep 24, 2026	0.5 $	0.5 $	-	-
whisper-1Audio Generation	0.006 $	0 $	-	-

X.ai


grok-4-0709Vision Capable	3 $	15 $	0.75 $	256K
grok-4-1-fast-non-reasoningVision Capable	ContextIn/Out per 1M tok ≤ 128K ↓ 0.2 $↑ 0.5 $ > 128K ↓ 0.4 $↑ 1 $ Cache: 0.05 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			2.0M
grok-4-1-fast-reasoningVision Capable	ContextIn/Out per 1M tok ≤ 128K ↓ 0.2 $↑ 0.5 $ > 128K ↓ 0.4 $↑ 1 $ Cache: 0.05 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			2.0M
grok-4-fastVision Capable	ContextIn/Out per 1M tok ≤ 128K ↓ 0.2 $↑ 0.5 $ > 128K ↓ 0.4 $↑ 1 $ Cache: 0.05 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			2.0M
grok-4-fast-non-reasoningVision Capable	ContextIn/Out per 1M tok ≤ 128K ↓ 0.2 $↑ 0.5 $ > 128K ↓ 0.4 $↑ 1 $ Cache: 0.05 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			2.0M
grok-4-fast-reasoningVision Capable	ContextIn/Out per 1M tok ≤ 128K ↓ 0.2 $↑ 0.5 $ > 128K ↓ 0.4 $↑ 1 $ Cache: 0.05 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			2.0M
grok-4.20-0309-non-reasoningVision Capable	ContextIn/Out per 1M tok ≤ 200K ↓ 1.25 $↑ 2.5 $ > 200K ↓ 2.5 $↑ 5 $ Cache: 0.2 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			1.0M
grok-4.20-0309-reasoningVision Capable	ContextIn/Out per 1M tok ≤ 200K ↓ 1.25 $↑ 2.5 $ > 200K ↓ 2.5 $↑ 5 $ Cache: 0.2 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			1.0M
grok-4.3Vision Capable	ContextIn/Out per 1M tok ≤ 200K ↓ 1.25 $↑ 2.5 $ > 200K ↓ 2.5 $↑ 5 $ Cache: 0.2 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			1.0M
grok-build-0.1Vision Capable	ContextIn/Out per 1M tok ≤ 200K ↓ 1 $↑ 2 $ > 200K ↓ 2 $↑ 4 $ Cache (≤ 200K):Read 0.2 $ Cache (> 200K):Read 0.4 $ Cache: 0.2 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			256K
grok-code-fast-1Vision Capable	ContextIn/Out per 1M tok ≤ 200K ↓ 1 $↑ 2 $ > 200K ↓ 2 $↑ 4 $ Cache (≤ 200K):Read 0.2 $ Cache (> 200K):Read 0.4 $ Cache: 0.2 $ per million Tool Calls: x_search: $0.005, web_search: $0.005, code_execution: $0.005, attachment_search: $0.01, collections_search: $0.0025			256K
grok-imagine-imageImage Generation	0.002 $	0.02 $	-	-
grok-imagine-image-proImage Generation	0.002 $	0.07 $	-	-
grok-imagine-image-qualityImage Generation	0.05 $	0.05 $	-	-
grok-imagine-videoVideo Generation	0 $	0.05 $	-	-
grok-imagine-video-1.5-previewVideo Generation	0.08 $	0.08 $	-	-
grok-ttsAudio Generation	4.2 $	0 $	-	-

Z.ai


cogvideox-3Video Generation	0.2 $	0.2 $	0 $	-
cogview-4-250304Image Generation	0.01 $	0.01 $	-	-
glm-4.5-air	0.2 $	1.1 $	0.03 $	128K
glm-4.6	0.6 $	2.2 $	0.11 $	200K
glm-4.6vVision Capable	0.3 $	0.9 $	0.05 $	128K
glm-4.6v-flashVision Capable	0 $	0 $	-	128K
glm-4.6v-flashxVision Capable	0.04 $	0.4 $	0.004 $	128K
glm-4.7	0.6 $	2.2 $	0.11 $	200K
glm-4.7-flash	0 $	0 $	-	128K
glm-4.7-flashx	0.07 $	0.4 $	0.01 $	128K
glm-5	1 $	3.2 $	0.2 $	200K
glm-5-turbo	1.2 $	4 $	0.24 $	200K
glm-5.1	1.4 $	4.4 $	0.26 $	200K
glm-5.2	1.4 $	4.4 $	0.26 $	1.0M
glm-5v-turboVision Capable	1.2 $	4 $	0.24 $	200K
glm-asr-2512Audio Generation	0.03 $	0 $	-	-
glm-imageImage Generation	0.015 $	0.015 $	-	-
vidu2-imageVideo Generation	0.2 $	0.2 $	0 $	-
vidu2-referenceVideo Generation	0.4 $	0.4 $	0 $	-
vidu2-start-endVideo Generation	0.2 $	0.2 $	0 $	-
viduq1-imageVideo Generation	0.4 $	0.4 $	0 $	-
viduq1-start-endVideo Generation	0.4 $	0.4 $	0 $	-
viduq1-textVideo Generation	0.4 $	0.4 $	0 $	-