Anthropic的Claude模型提供先进的安全AI能力,专注于有用、无害、诚实的AI助手体验,并具备强大的推理和对话能力。
| Model Name | Input Token Range | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Actions |
|---|---|---|---|---|---|---|---|
| claude-opus-4-8 | - | 1,000,000 | $5 | $6.25 (5 min) · $10 (1 hr) | $0.5 | $25 | Go check it out |
| claude-opus-4-8-r | - | 1,000,000 | $1$5 | $1.25(5m)·$2(1h)$6.25(5m)·$10(1h) | $0.1$0.5 | $5$25 | Go check it out |
| claude-opus-4-7 | - | 1,000,000 | $4.75 $5 | $5.9375 (5 min) · $9.50 (1 hr) $6.25 (5 min) × $10 (1 hr) | $0.475 $0.5 | $23.75 $25 | Go check it out |
| claude-opus-4-7-r | - | 1,000,000 | $1$5 | $1.25(5m)·$2(1h)$6.25(5m)·$10(1h) | $0.1$0.5 | $5$25 | Go check it out |
| Claude, Op. 4, No. 6 | 1–200,000 | 1,000,000 | $5 | $6.25 (5 min) · $10 (1 hr) | $0.5 | $25 | Go check it out |
| 200,000–1,000,000 | 1,000,000 | $5 | $6.25 (5 min) · $10 (1 hr) | $0.5 | $25 | Go check it out | |
| claude-opus-4-6-r | - | 1,000,000 | $1$5 | $1.25(5m)·$2(1h)$6.25(5m)·$10(1h) | $0.1$0.5 | $5$25 | Go check it out |
| claude-opus-4-6-dd | - | 1,000,000 | $2.75$5 | $3.4375(5m)·$5.5(1h)$6.25(5m)·$10(1h) | $0.275$0.5 | $13.75$25 | Go check it out |
| claude-sonnet-4-6 | 1–200,000 | 1,000,000 | $3 | $3.75 (5 min) · $6 (1 hr) | $0.3 | $15 | Go check it out |
| 200,000–1,000,000 | 1,000,000 | $3 | $3.75 (5 min) · $6 (1 hr) | $0.3 | $15 | Go check it out | |
| claude-sonnet-4-6-r | - | 1,000,000 | $0.6$3 | $0.75(5m)·$1.2(1h)$3.75(5m)·$6(1h) | $0.06$0.3 | $3$15 | Go check it out |
| claude-sonnet-4-6-dd | - | 1,000,000 | $1.65$3 | $2.0625(5m)·$3.3(1h)$3.75(5m)·$6(1h) | $0.165$0.3 | $8.25$15 | Go check it out |
| claude-opus-4-5-20251101 | - | 200,000 | $4.75 $5 | $5.9375 (5 min) · $9.50 (1 hr) $6.25 (5 min) × $10 (1 hr) | $0.475 $0.5 | $23.75 $25 | Go check it out |
| claude-opus-4-5-20251101-dd | - | 200,000 | $2.75$5 | $3.4375(5m)$6.25(5m) | $0.275$0.5 | $13.75$25 | Go check it out |
| claude-sonnet-4-5-20250929 | 1–200,000 | 200,000 | $3 | $3.75 (5 min) · $6 (1 hr) | $0.3 | $15 | Go check it out |
| 200,000–1,000,000 | 200,000 | $6 | $7.5 (5 min) × $12 (1 hr) | $0.6 | $22.5 | Go check it out | |
| claude-sonnet-4-5-20250929-dd | - | 200,000 | $1.65$3 | $2.0625(5m)$3.75(5m) | $0.165$0.3 | $8.25$15 | Go check it out |
| claude-haiku-4-5-20251001 | - | 20,000 | $1 | $1.25 (5 min) · $2 (1 hr) | $0.1 | $5 | Go check it out |
| claude-haiku-4-5-20251001-r | - | 200,000 | $0.2$1 | $0.25(5m)·$0.4(1h)$1.25(5m)·$2(1h) | $0.02$0.1 | $1$5 | Go check it out |
| claude-haiku-4-5-20251001-dd | - | 200,000 | $0.55$1 | $0.6875(5m)·$1.1(1h)$1.25(5m)·$2(1h) | $0.055$0.1 | $2.75$5 | Go check it out |
| claude-sonnet-4-20250514 | - | 200,000 | $2.85 $3 | $3.5625 (5m) $3.75 (5-month) | $0.285 $0.3 | $14.25 $15 | Go check it out |
OpenAI's GPT series of models offer state-of-the-art language understanding and generation capabilities, delivering outstanding performance across a wide range of tasks, and are among the industry's leading AI models.
| Model Name | Input Token Range | Context | Input (/Mt) | Cache read (/Mt) | Output (/Mt) | Actions |
|---|---|---|---|---|---|---|
| gpt-5.5 | 1–272,000 | 1,050,000 | $5 | $0.5 | $30 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $10 | $1 | $45 | Go check it out | |
| gpt-5.5-pro | 1–272,000 | 1,050,000 | $30 | - | $180 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $60 | - | $270 | Go check it out | |
| gpt-5.5-r | - | 1,050,000 | $0.25$5 | $0.025$0.5 | $1.5$30 | Go check it out |
| gpt-5.5-light | 1–272,000 | 1,050,000 | $0.25$5 | $0.025$0.5 | $1.5$30 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $0.5$10 | $0.05$1 | $2.25$45 | Go check it out | |
| gpt-5.4-nano | - | 400,000 | $0.19 $0.2 | $0.019 $0.02 | $1.1875 $1.25 | Go check it out |
| gpt-5.4-mini | - | 400,000 | $0.7125 $0.75 | $0.0712 $0.075 | $4.275 $4.5 | Go check it out |
| gpt-5.4-pro | 1–272,000 | 1,050,000 | $30 | - | $180 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $60 | - | $270 | Go check it out | |
| GPT-5.4 | 1–272,000 | 1,050,000 | $2.5 | $0.25 | $15 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $5 | $0.5 | $22.5 | Go check it out | |
| gpt-5.3-chat-latest | - | 128,000 | $1.6625 $1.75 | $0.1662 $0.175 | $13.3 $14 | Go check it out |
| gpt-5.3-codex | - | 400,000 | $1.6625 $1.75 | $0.1662 $0.175 | $13.3 $14 | Go check it out |
| gpt-5.2-codex | - | 400,000 | $1.75 | $0.175 | $14 | Go check it out |
| GPT-5.2 | - | 400,000 | $1.6625 $1.75 | $0.1662 $0.175 | $13.3 $14 | Go check it out |
| gpt-5.2-pro | - | 400,000 | $19.95 $21 | - | $159.6 $168 | Go check it out |
| gpt-5.2-chat-latest | - | 128,000 | $1.6625 $1.75 | $0.1662 $0.175 | $13.3 $14 | Go check it out |
| gpt-5.1-codex-max | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5.1-codex-mini | - | 400,000 | $0.2375 $0.25 | $0.0237 $0.025 | $1.9 $2 | Go check it out |
| gpt-5.1-codex | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5.1-chat-latest | - | 128,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5.1 | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5-pro | - | 400,000 | $14.25 $15 | - | $114 $120 | Go check it out |
| gpt-5-codex | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5-chat-latest | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5-nano | - | 400,000 | $0.0475 $0.05 | $0.0047 $0.005 | $0.38 $0.4 | Go check it out |
| gpt-5-mini | - | 400,000 | $0.2375 $0.25 | $0.0237 $0.025 | $1.9 $2 | Go check it out |
| GPT-5 | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| OpenAI: GPT OSS 20B | - | 131,072 | $0.05 | - | $0.2 | Go check it out |
| OpenAI GPT OSS 120B | - | 131,072 | $0.1 | - | $0.5 | Go check it out |
| gpt-4.1-mini | - | 1,047,576 | $0.4 | $0.1 | $1.6 | Go check it out |
| gpt-4.1-nano | - | 1,047,576 | $0.1 | $0.025 | $0.4 | Go check it out |
| gpt-4.1 | - | 1,047,576 | $2 | $0.5 | $8 | Go check it out |
| GPT-4o-mini | - | 128,000 | $0.1425 $0.15 | $0.0712 $0.075 | $0.57 $0.6 | Go check it out |
| GPT-4O | - | 131,072 | $2.375 $2.5 | $1.1875 $1.25 | $9.5 $10 | Go check it out |
Google的Gemini模型提供高质量的语言处理能力,在各种NLP任务中表现出色,并具备强大的多模态能力。
| Model Name | Input Token Range | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Actions |
|---|---|---|---|---|---|---|---|
| gemini-3.1-pro-preview | 1–204,800 | 1,048,576 | $2 | $0.375 (5 min) × $4.5 (1 hr) | $0.2 | $12 | Go check it out |
| 204,800–1,048,576 | 1,048,576 | $4 | $0.375 (5 min) × $4.5 (1 hr) | $0.4 | $18 | Go check it out | |
| gemini-3.1-flash-lite-preview | - | 1,048,576 | $0.2375 $0.25 | $0.0791 (5 min) × $0.95 (1 hr) $0.0833 (5 min) × $1 (1 hr) | $0.0237 $0.025 | $1.425 $1.5 | Go check it out |
| gemini-3-flash-preview | - | 1,048,576 | $0.475 $0.5 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0475 $0.05 | $2.85 $3 | Go check it out |
| gemini-2.5-flash-lite-preview-09-2025 | - | 1,048,576 | $0.095 $0.1 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0095 $0.01 | $0.38 $0.4 | Go check it out |
| gemini-2.5-flash-lite | - | 1,048,576 | $0.095 $0.1 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0095 $0.01 | $0.38 $0.4 | Go check it out |
| gemini-2.5-pro | - | 1,048,576 | $1.1875 $1.25 | $0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr) | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gemini-2.5-flash | - | 1,048,576 | $0.285 $0.3 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0285 $0.03 | $2.375 $2.5 | Go check it out |
| gemini-2.5-flash-lite-preview-06-17 | - | 1,048,576 | $0.095 $0.1 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0095 $0.01 | $0.38 $0.4 | Go check it out |
| gemini-2.5-flash-preview-05-20 | - | 1,048,576 | $0.1425 $0.15 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0285 $0.03 | $3.325 $3.5 | Go check it out |
| gemini-2.5-pro-preview-06-05 | - | 1,048,576 | $1.1875 $1.25 | $0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr) | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gemini-2.0-flash-lite | - | 1,048,576 | $0.0712 $0.075 | - | - | $0.285 $0.3 | Go check it out |
| gemini-2.0-flash-20250609 | - | 1,048,576 | $0.1425 $0.15 | - | - | $0.57 $0.6 | Go check it out |
| Gemma3 12B | - | 131,072 | $0.05 | - | - | $0.1 | Go check it out |
| Gemma 3 27B | - | 32,768 | $0.119 | - | - | $0.2 | Go check it out |
| gemini-3.5-flash | - | 1,048,576 | $1.5 | $0.083(5m)·$1(1h) | $0.15 | $9 | Go check it out |
Meta's Llama model offers state-of-the-art language understanding capabilities and features an open architecture, making it suitable for a wide range of applications.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Llama 4 Maverick Instructions | 1,048,576 | $0.17 | $0.85 | Go check it out |
| Llama 4 Scout Instructor | 131,072 | $0.1 | $0.5 | Go check it out |
| Llama 3.3 70B Instruct | 131,072 | $0.13 | $0.39 | Go check it out |
| Llama 3.2 3B Instruct | 32,768 | $0.03 | $0.05 | Go check it out |
| Llama 3.1 8B Instruct | 16,384 | $0.02 | $0.05 | Go check it out |
Qwen系列模型提供高效的语言处理能力,具有多种参数规模,涵盖从轻量级到企业级的解决方案。
百度的ERNIE模型提供先进的中文语言理解和多模态能力,针对中文应用进行了优化,并具备具有竞争力的价格。
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| ERNIE 4.5 VL 424B A47B | 123,000 | $0.42 | $1.25 | Go check it out |
| ERNIE 4.5 300B A47B | 123,000 | $0.28 | $1.1 | Go check it out |
来自清华大学的GLM系列模型,具备先进的中文语言理解和生成能力。
| Model Name | Context | Input (/Mt) | Cache read (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|---|
| GLM-5.1 | 204,800 | $1.38 | $0.26 | $4.4 | Go check it out |
| GLM-5V-Turbo | 204,800 | $1.2 | $0.24 | $4 | Go check it out |
| GLM-5-Turbo | 202,800 | $1.2 | $0.24 | $4 | Go check it out |
| GLM-5 | 204,800 | $1 | $0.2 | $3.2 | Go check it out |
| GLM-OCR | 32,000 | $0.03 | - | $0.03 | Go check it out |
| GLM-4.7-Flash | 200,000 | $0.07 | $0.01 | $0.4 | Go check it out |
| GLM-4.7 | 204,800 | $0.6 | - | $2.2 | Go check it out |
| GLM 4.5V | 65,536 | $0.6 | - | $1.8 | Go check it out |
| GLM-4.5 | 131,072 | $0.6 | - | $2.2 | Go check it out |
A fine-tuned model specifically optimized for creative and role-playing applications, featuring enhanced storytelling capabilities.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| L3 8B Stheno V3.2 | 8,192 | $0.05 | $0.05 | Go check it out |
| Sao10k L3 8B Lunaris | 8,192 | $0.05 | $0.05 | Go check it out |
| L31 70B Euryale V2.2 | 8,192 | $1.48 | $1.48 | Go check it out |
| L3 70B Euryale V2.1 | 8,192 | $1.48 | $1.48 | Go check it out |
A powerful and efficient language model from Mistral AI, designed for both commercial and open-source applications.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Mistral Nemo | 60,288 | $0.04 | $0.17 | Go check it out |
| Mistral 7B Instruct | 32,768 | $0.029 | $0.059 | Go check it out |
Advanced AI models from DeepSeek, offering cutting-edge inference capabilities and competitive pricing for enterprise and research applications.
| Model Name | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|---|---|
| Deepseek V4 Flash | 1,048,576 | $0.14 | - | $0.028 | $0.28 | Go check it out |
| Deepseek V4 Pro | 1,048,576 | $1.74 | - | $0.145 | $3.48 | Go check it out |
| DeepSeek-OCR 2 | 8,192 | $0.03 | - | - | $0.03 | Go check it out |
| DeepSeek V3.1 | 163,840 | $0.27 | - | - | $1 | Go check it out |
| DeepSeek R1 0528 | 163,840 | $0.7 | - | $0.35 | $2.5 | Go check it out |
| DeepSeek V3 0324 | 163,840 | $0.28 | $0.14 (5m) | $0.14 | $1.14 | Go check it out |
MiniMax AI的先进语言模型提供强大的对话AI能力,在客户服务、内容生成和创意应用中表现优异,并具备强大的多语言支持和企业级可扩展性。
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| MiniMax M1 | 1,000,000 | $0.55 | $2.2 | Go check it out |
An innovative AI model from Gryphe that offers professional-grade language understanding capabilities, with a focus on efficiency and adaptability, making it ideal for niche applications.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Mythomax L2 13B | 4,096 | $0.09 | $0.09 | Go check it out |
最先进AI模型的高级集合,具备高级推理、数学证明能力以及跨多个领域的前沿语言理解能力。