Large Model API Pricing
Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.
Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.
Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.
Anthropic's Claude model offers advanced AI safety capabilities, focusing on useful, harmless, and honest AI assistants with powerful reasoning and conversational abilities.
| Model Name | Input Token Range | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Actions |
|---|---|---|---|---|---|---|---|
| claude-opus-4-7 | - | 1,000,000 | $4.75 $5 | $5.9375 (5 min) · $9.50 (1 hr) $6.25 (5 min) × $10 (1 hr) | $0.475 $0.5 | $23.75 $25 | Go check it out |
| Claude, Op. 4, No. 6 | 1–200,000 | 1,000,000 | $5 | $6.25 (5 min) · $10 (1 hr) | $0.5 | $25 | Go check it out |
| 200,000–1,000,000 | 1,000,000 | $5 | $6.25 (5 min) · $10 (1 hr) | $0.5 | $25 | Go check it out | |
| claude-opus-4-6-dd | - | 1,000,000 | $2.75$5 | $3.4375(5m)·$5.5(1h)$6.25(5m)·$10(1h) | $0.275$0.5 | $13.75$25 | Go check it out |
| claude-sonnet-4-6 | 1–200,000 | 1,000,000 | $3 | $3.75 (5 min) · $6 (1 hr) | $0.3 | $15 | Go check it out |
| 200,000–1,000,000 | 1,000,000 | $3 | $3.75 (5 min) · $6 (1 hr) | $0.3 | $15 | Go check it out | |
| claude-sonnet-4-6-dd | - | 1,000,000 | $1.65$3 | $2.0625(5m)·$3.3(1h)$3.75(5m)·$6(1h) | $0.165$0.3 | $8.25$15 | Go check it out |
| claude-opus-4-5-20251101 | - | 200,000 | $4.75 $5 | $5.9375 (5 min) · $9.50 (1 hr) $6.25 (5 min) × $10 (1 hr) | $0.475 $0.5 | $23.75 $25 | Go check it out |
| claude-opus-4-5-20251101-dd | - | 200,000 | $2.75$5 | $3.4375(5m)$6.25(5m) | $0.275$0.5 | $13.75$25 | Go check it out |
| claude-sonnet-4-5-20250929 | 1–200,000 | 200,000 | $3 | $3.75 (5 min) · $6 (1 hr) | $0.3 | $15 | Go check it out |
| 200,000–1,000,000 | 200,000 | $6 | $7.5 (5 min) × $12 (1 hr) | $0.6 | $22.5 | Go check it out | |
| claude-sonnet-4-5-20250929-dd | - | 200,000 | $1.65$3 | $2.0625(5m)$3.75(5m) | $0.165$0.3 | $8.25$15 | Go check it out |
| claude-haiku-4-5-20251001 | - | 20,000 | $1 | $1.25 (5 min) · $2 (1 hr) | $0.1 | $5 | Go check it out |
| claude-haiku-4-5-20251001-dd | - | 200,000 | $0.55$1 | $0.6875(5m)·$1.1(1h)$1.25(5m)·$2(1h) | $0.055$0.1 | $2.75$5 | Go check it out |
| claude-sonnet-4-20250514 | - | 200,000 | $2.85 $3 | $3.5625 (5m) $3.75 (5-month) | $0.285 $0.3 | $14.25 $15 | Go check it out |
OpenAI's GPT series of models offer state-of-the-art language understanding and generation capabilities, delivering outstanding performance across a wide range of tasks, and are among the industry's leading AI models.
| Model Name | Input Token Range | Context | Input (/Mt) | Cache read (/Mt) | Output (/Mt) | Actions |
|---|---|---|---|---|---|---|
| gpt-5.5 | 1–272,000 | 1,050,000 | $5 | $0.5 | $30 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $10 | $1 | $45 | Go check it out | |
| gpt-5.5-pro | 1–272,000 | 1,050,000 | $30 | - | $180 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $60 | - | $270 | Go check it out | |
| gpt-5.5-light | 1–272,000 | 1,050,000 | $0.25$5 | $0.025$0.5 | $1.5$30 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $0.5$10 | $0.05$1 | $2.25$45 | Go check it out | |
| gpt-5.4-nano | - | 400,000 | $0.19 $0.2 | $0.019 $0.02 | $1.1875 $1.25 | Go check it out |
| gpt-5.4-mini | - | 400,000 | $0.7125 $0.75 | $0.0712 $0.075 | $4.275 $4.5 | Go check it out |
| gpt-5.4-pro | 1–272,000 | 1,050,000 | $30 | - | $180 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $60 | - | $270 | Go check it out | |
| GPT-5.4 | 1–272,000 | 1,050,000 | $2.5 | $0.25 | $15 | Go check it out |
| 272,000–1,050,000 | 1,050,000 | $5 | $0.5 | $22.5 | Go check it out | |
| gpt-5.3-chat-latest | - | 128,000 | $1.6625 $1.75 | $0.1662 $0.175 | $13.3 $14 | Go check it out |
| gpt-5.3-codex | - | 400,000 | $1.6625 $1.75 | $0.1662 $0.175 | $13.3 $14 | Go check it out |
| gpt-5.2-codex | - | 400,000 | $1.75 | $0.175 | $14 | Go check it out |
| GPT-5.2 | - | 400,000 | $1.6625 $1.75 | $0.1662 $0.175 | $13.3 $14 | Go check it out |
| gpt-5.2-pro | - | 400,000 | $19.95 $21 | - | $159.6 $168 | Go check it out |
| gpt-5.2-chat-latest | - | 128,000 | $1.6625 $1.75 | $0.1662 $0.175 | $13.3 $14 | Go check it out |
| gpt-5.1-codex-max | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5.1-codex-mini | - | 400,000 | $0.2375 $0.25 | $0.0237 $0.025 | $1.9 $2 | Go check it out |
| gpt-5.1-codex | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5.1-chat-latest | - | 128,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5.1 | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5-pro | - | 400,000 | $14.25 $15 | - | $114 $120 | Go check it out |
| gpt-5-codex | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5-chat-latest | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gpt-5-nano | - | 400,000 | $0.0475 $0.05 | $0.0047 $0.005 | $0.38 $0.4 | Go check it out |
| gpt-5-mini | - | 400,000 | $0.2375 $0.25 | $0.0237 $0.025 | $1.9 $2 | Go check it out |
| GPT-5 | - | 400,000 | $1.1875 $1.25 | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| OpenAI: GPT OSS 20B | - | 131,072 | $0.05 | - | $0.2 | Go check it out |
| OpenAI GPT OSS 120B | - | 131,072 | $0.1 | - | $0.5 | Go check it out |
| gpt-4.1-mini | - | 1,047,576 | $0.4 | $0.1 | $1.6 | Go check it out |
| gpt-4.1-nano | - | 1,047,576 | $0.1 | $0.025 | $0.4 | Go check it out |
| gpt-4.1 | - | 1,047,576 | $2 | $0.5 | $8 | Go check it out |
| GPT-4o-mini | - | 128,000 | $0.1425 $0.15 | $0.0712 $0.075 | $0.57 $0.6 | Go check it out |
| GPT-4O | - | 131,072 | $2.375 $2.5 | $1.1875 $1.25 | $9.5 $10 | Go check it out |
Google's Gemini model offers high-quality natural language processing capabilities, performs exceptionally well across a wide range of NLP tasks, and boasts powerful multimodal capabilities.
| Model Name | Input Token Range | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Actions |
|---|---|---|---|---|---|---|---|
| gemini-3.1-pro-preview | 1–204,800 | 1,048,576 | $2 | $0.375 (5 min) × $4.5 (1 hr) | $0.2 | $12 | Go check it out |
| 204,800–1,048,576 | 1,048,576 | $4 | $0.375 (5 min) × $4.5 (1 hr) | $0.4 | $18 | Go check it out | |
| gemini-3.1-flash-lite-preview | - | 1,048,576 | $0.2375 $0.25 | $0.0791 (5 min) × $0.95 (1 hr) $0.0833 (5 min) × $1 (1 hr) | $0.0237 $0.025 | $1.425 $1.5 | Go check it out |
| gemini-3-flash-preview | - | 1,048,576 | $0.475 $0.5 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0475 $0.05 | $2.85 $3 | Go check it out |
| gemini-2.5-flash-lite-preview-09-2025 | - | 1,048,576 | $0.095 $0.1 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0095 $0.01 | $0.38 $0.4 | Go check it out |
| gemini-2.5-flash-lite | - | 1,048,576 | $0.095 $0.1 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0095 $0.01 | $0.38 $0.4 | Go check it out |
| gemini-2.5-pro | - | 1,048,576 | $1.1875 $1.25 | $0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr) | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gemini-2.5-flash | - | 1,048,576 | $0.285 $0.3 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0285 $0.03 | $2.375 $2.5 | Go check it out |
| gemini-2.5-flash-lite-preview-06-17 | - | 1,048,576 | $0.095 $0.1 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0095 $0.01 | $0.38 $0.4 | Go check it out |
| gemini-2.5-flash-preview-05-20 | - | 1,048,576 | $0.1425 $0.15 | $0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr) | $0.0285 $0.03 | $3.325 $3.5 | Go check it out |
| gemini-2.5-pro-preview-06-05 | - | 1,048,576 | $1.1875 $1.25 | $0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr) | $0.1187 $0.125 | $9.5 $10 | Go check it out |
| gemini-2.0-flash-lite | - | 1,048,576 | $0.0712 $0.075 | - | - | $0.285 $0.3 | Go check it out |
| gemini-2.0-flash-20250609 | - | 1,048,576 | $0.1425 $0.15 | - | - | $0.57 $0.6 | Go check it out |
| Gemma3 12B | - | 131,072 | $0.05 | - | - | $0.1 | Go check it out |
| Gemma 3 27B | - | 32,768 | $0.119 | - | - | $0.2 | Go check it out |
| gemini-3.5-flash | - | 1,048,576 | $1.5 | $0.083(5m)·$1(1h) | $0.15 | $9 | Go check it out |
Meta's Llama model offers state-of-the-art language understanding capabilities and features an open architecture, making it suitable for a wide range of applications.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Llama 4 Maverick Instructions | 1,048,576 | $0.17 | $0.85 | Go check it out |
| Llama 4 Scout Instructor | 131,072 | $0.1 | $0.5 | Go check it out |
| Llama 3.3 70B Instruct | 131,072 | $0.13 | $0.39 | Go check it out |
| Llama 3.2 3B Instruct | 32,768 | $0.03 | $0.05 | Go check it out |
| Llama 3.1 8B Instruct | 16,384 | $0.02 | $0.05 | Go check it out |
The Qwen series of models offers powerful natural language processing capabilities and is available in a range of parameter sizes, from lightweight to enterprise-grade solutions.
Baidu's ERNIE model offers advanced Chinese language understanding and multimodal capabilities, is optimized for Chinese applications, and is competitively priced.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| ERNIE 4.5 VL 424B A47B | 123,000 | $0.42 | $1.25 | Go check it out |
| ERNIE 4.5 300B A47B | 123,000 | $0.28 | $1.1 | Go check it out |
The GLM series of models from Tsinghua University feature advanced Chinese language understanding and generation capabilities.
| Model Name | Context | Input (/Mt) | Cache read (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|---|
| GLM-5.1 | 204,800 | $1.38 | $0.26 | $4.4 | Go check it out |
| GLM-5V-Turbo | 204,800 | $1.2 | $0.24 | $4 | Go check it out |
| GLM-5-Turbo | 202,800 | $1.2 | $0.24 | $4 | Go check it out |
| GLM-5 | 204,800 | $1 | $0.2 | $3.2 | Go check it out |
| GLM-OCR | 32,000 | $0.03 | - | $0.03 | Go check it out |
| GLM-4.7-Flash | 200,000 | $0.07 | $0.01 | $0.4 | Go check it out |
| GLM-4.7 | 204,800 | $0.6 | - | $2.2 | Go check it out |
| GLM 4.5V | 65,536 | $0.6 | - | $1.8 | Go check it out |
| GLM-4.5 | 131,072 | $0.6 | - | $2.2 | Go check it out |
A fine-tuned model specifically optimized for creative and role-playing applications, featuring enhanced storytelling capabilities.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| L3 8B Stheno V3.2 | 8,192 | $0.05 | $0.05 | Go check it out |
| Sao10k L3 8B Lunaris | 8,192 | $0.05 | $0.05 | Go check it out |
| L31 70B Euryale V2.2 | 8,192 | $1.48 | $1.48 | Go check it out |
| L3 70B Euryale V2.1 | 8,192 | $1.48 | $1.48 | Go check it out |
A powerful and efficient language model from Mistral AI, designed for both commercial and open-source applications.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Mistral Nemo | 60,288 | $0.04 | $0.17 | Go check it out |
| Mistral 7B Instruct | 32,768 | $0.029 | $0.059 | Go check it out |
Advanced AI models from DeepSeek, offering cutting-edge inference capabilities and competitive pricing for enterprise and research applications.
| Model Name | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|---|---|
| Deepseek V4 Flash | 1,048,576 | $0.14 | - | $0.028 | $0.28 | Go check it out |
| Deepseek V4 Pro | 1,048,576 | $1.74 | - | $0.145 | $3.48 | Go check it out |
| DeepSeek-OCR 2 | 8,192 | $0.03 | - | - | $0.03 | Go check it out |
| DeepSeek V3.1 | 163,840 | $0.27 | - | - | $1 | Go check it out |
| DeepSeek R1 0528 | 163,840 | $0.7 | - | $0.35 | $2.5 | Go check it out |
| DeepSeek V3 0324 | 163,840 | $0.28 | $0.14 (5m) | $0.14 | $1.14 | Go check it out |
MiniMax AI's advanced language model delivers powerful conversational AI capabilities, excelling in customer service, content generation, and creative applications, with robust multilingual support and enterprise-grade scalability.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| MiniMax M1 | 1,000,000 | $0.55 | $2.2 | Go check it out |
An innovative AI model from Gryphe that offers professional-grade language understanding capabilities, with a focus on efficiency and adaptability, making it ideal for niche applications.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Mythomax L2 13B | 4,096 | $0.09 | $0.09 | Go check it out |
A sophisticated collection of state-of-the-art AI models, featuring advanced reasoning and mathematical proof capabilities, as well as cutting-edge language understanding across multiple domains.