Large Model API Pricing

Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.

Anthropic logo

Anthropic

Anthropic's Claude model offers advanced AI safety capabilities, focusing on useful, harmless, and honest AI assistants with powerful reasoning and conversational abilities.

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
claude-opus-4-7-1,000,000
$4.75 $5
$5.9375 (5 min) · $9.50 (1 hr) $6.25 (5 min) × $10 (1 hr)
$0.475 $0.5
$23.75 $25
Go check it out
Claude, Op. 4, No. 61–200,0001,000,000$5$6.25 (5 min) · $10 (1 hr)$0.5$25Go check it out
200,000–1,000,0001,000,000$5$6.25 (5 min) · $10 (1 hr)$0.5$25Go check it out
claude-opus-4-6-dd-1,000,000
$2.75$5
$3.4375(5m)·$5.5(1h)$6.25(5m)·$10(1h)
$0.275$0.5
$13.75$25
Go check it out
claude-sonnet-4-61–200,0001,000,000$3$3.75 (5 min) · $6 (1 hr)$0.3$15Go check it out
200,000–1,000,0001,000,000$3$3.75 (5 min) · $6 (1 hr)$0.3$15Go check it out
claude-sonnet-4-6-dd-1,000,000
$1.65$3
$2.0625(5m)·$3.3(1h)$3.75(5m)·$6(1h)
$0.165$0.3
$8.25$15
Go check it out
claude-opus-4-5-20251101-200,000
$4.75 $5
$5.9375 (5 min) · $9.50 (1 hr) $6.25 (5 min) × $10 (1 hr)
$0.475 $0.5
$23.75 $25
Go check it out
claude-opus-4-5-20251101-dd-200,000
$2.75$5
$3.4375(5m)$6.25(5m)
$0.275$0.5
$13.75$25
Go check it out
claude-sonnet-4-5-202509291–200,000200,000$3$3.75 (5 min) · $6 (1 hr)$0.3$15Go check it out
200,000–1,000,000200,000$6$7.5 (5 min) × $12 (1 hr)$0.6$22.5Go check it out
claude-sonnet-4-5-20250929-dd-200,000
$1.65$3
$2.0625(5m)$3.75(5m)
$0.165$0.3
$8.25$15
Go check it out
claude-haiku-4-5-20251001-20,000$1$1.25 (5 min) · $2 (1 hr)$0.1$5Go check it out
claude-haiku-4-5-20251001-dd-200,000
$0.55$1
$0.6875(5m)·$1.1(1h)$1.25(5m)·$2(1h)
$0.055$0.1
$2.75$5
Go check it out
claude-sonnet-4-20250514-200,000
$2.85 $3
$3.5625 (5m) $3.75 (5-month)
$0.285 $0.3
$14.25 $15
Go check it out
OpenAI

OpenAI

OpenAI's GPT series of models offer state-of-the-art language understanding and generation capabilities, delivering outstanding performance across a wide range of tasks, and are among the industry's leading AI models.

Model NameInput Token RangeContextInput (/Mt)Cache read (/Mt)Output (/Mt)Actions
gpt-5.51–272,0001,050,000$5$0.5$30Go check it out
272,000–1,050,0001,050,000$10$1$45Go check it out
gpt-5.5-pro1–272,0001,050,000$30-$180Go check it out
272,000–1,050,0001,050,000$60-$270Go check it out
gpt-5.5-light1–272,0001,050,000
$0.25$5
$0.025$0.5
$1.5$30
Go check it out
272,000–1,050,0001,050,000
$0.5$10
$0.05$1
$2.25$45
Go check it out
gpt-5.4-nano-400,000
$0.19 $0.2
$0.019 $0.02
$1.1875 $1.25
Go check it out
gpt-5.4-mini-400,000
$0.7125 $0.75
$0.0712 $0.075
$4.275 $4.5
Go check it out
gpt-5.4-pro1–272,0001,050,000$30-$180Go check it out
272,000–1,050,0001,050,000$60-$270Go check it out
GPT-5.41–272,0001,050,000$2.5$0.25$15Go check it out
272,000–1,050,0001,050,000$5$0.5$22.5Go check it out
gpt-5.3-chat-latest-128,000
$1.6625 $1.75
$0.1662 $0.175
$13.3 $14
Go check it out
gpt-5.3-codex-400,000
$1.6625 $1.75
$0.1662 $0.175
$13.3 $14
Go check it out
gpt-5.2-codex-400,000$1.75$0.175$14Go check it out
GPT-5.2-400,000
$1.6625 $1.75
$0.1662 $0.175
$13.3 $14
Go check it out
gpt-5.2-pro-400,000
$19.95 $21
-
$159.6 $168
Go check it out
gpt-5.2-chat-latest-128,000
$1.6625 $1.75
$0.1662 $0.175
$13.3 $14
Go check it out
gpt-5.1-codex-max-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-5.1-codex-mini-400,000
$0.2375 $0.25
$0.0237 $0.025
$1.9 $2
Go check it out
gpt-5.1-codex-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-5.1-chat-latest-128,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-5.1-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-5-pro-400,000
$14.25 $15
-
$114 $120
Go check it out
gpt-5-codex-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-5-chat-latest-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-5-nano-400,000
$0.0475 $0.05
$0.0047 $0.005
$0.38 $0.4
Go check it out
gpt-5-mini-400,000
$0.2375 $0.25
$0.0237 $0.025
$1.9 $2
Go check it out
GPT-5-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
OpenAI: GPT OSS 20B-131,072$0.05-$0.2Go check it out
OpenAI GPT OSS 120B-131,072$0.1-$0.5Go check it out
gpt-4.1-mini-1,047,576$0.4$0.1$1.6Go check it out
gpt-4.1-nano-1,047,576$0.1$0.025$0.4Go check it out
gpt-4.1-1,047,576$2$0.5$8Go check it out
GPT-4o-mini-128,000
$0.1425 $0.15
$0.0712 $0.075
$0.57 $0.6
Go check it out
GPT-4O-131,072
$2.375 $2.5
$1.1875 $1.25
$9.5 $10
Go check it out
Gemini logo

Gemini

Google's Gemini model offers high-quality natural language processing capabilities, performs exceptionally well across a wide range of NLP tasks, and boasts powerful multimodal capabilities.

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
gemini-3.1-pro-preview1–204,8001,048,576$2$0.375 (5 min) × $4.5 (1 hr)$0.2$12Go check it out
204,800–1,048,5761,048,576$4$0.375 (5 min) × $4.5 (1 hr)$0.4$18Go check it out
gemini-3.1-flash-lite-preview-1,048,576
$0.2375 $0.25
$0.0791 (5 min) × $0.95 (1 hr) $0.0833 (5 min) × $1 (1 hr)
$0.0237 $0.025
$1.425 $1.5
Go check it out
gemini-3-flash-preview-1,048,576
$0.475 $0.5
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0475 $0.05
$2.85 $3
Go check it out
gemini-2.5-flash-lite-preview-09-2025-1,048,576
$0.095 $0.1
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0095 $0.01
$0.38 $0.4
Go check it out
gemini-2.5-flash-lite-1,048,576
$0.095 $0.1
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0095 $0.01
$0.38 $0.4
Go check it out
gemini-2.5-pro-1,048,576
$1.1875 $1.25
$0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr)
$0.1187 $0.125
$9.5 $10
Go check it out
gemini-2.5-flash-1,048,576
$0.285 $0.3
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0285 $0.03
$2.375 $2.5
Go check it out
gemini-2.5-flash-lite-preview-06-17-1,048,576
$0.095 $0.1
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0095 $0.01
$0.38 $0.4
Go check it out
gemini-2.5-flash-preview-05-20-1,048,576
$0.1425 $0.15
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0285 $0.03
$3.325 $3.5
Go check it out
gemini-2.5-pro-preview-06-05-1,048,576
$1.1875 $1.25
$0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr)
$0.1187 $0.125
$9.5 $10
Go check it out
gemini-2.0-flash-lite-1,048,576
$0.0712 $0.075
--
$0.285 $0.3
Go check it out
gemini-2.0-flash-20250609-1,048,576
$0.1425 $0.15
--
$0.57 $0.6
Go check it out
Gemma3 12B-131,072$0.05--$0.1Go check it out
Gemma 3 27B-32,768$0.119--$0.2Go check it out
gemini-3.5-flash-1,048,576$1.5$0.083(5m)·$1(1h)$0.15$9Go check it out
Llama logo

Llama

Meta's Llama model offers state-of-the-art language understanding capabilities and features an open architecture, making it suitable for a wide range of applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
Llama 4 Maverick Instructions1,048,576$0.17$0.85Go check it out
Llama 4 Scout Instructor131,072$0.1$0.5Go check it out
Llama 3.3 70B Instruct131,072$0.13$0.39Go check it out
Llama 3.2 3B Instruct32,768$0.03$0.05Go check it out
Llama 3.1 8B Instruct16,384$0.02$0.05Go check it out
Qwen logo

Qwen

The Qwen series of models offers powerful natural language processing capabilities and is available in a range of parameter sizes, from lightweight to enterprise-grade solutions.

Wenxin

Baidu

Baidu's ERNIE model offers advanced Chinese language understanding and multimodal capabilities, is optimized for Chinese applications, and is competitively priced.

Model NameContextInput (/Mt)Output (/Mt)Operation
ERNIE 4.5 VL 424B A47B123,000$0.42$1.25Go check it out
ERNIE 4.5 300B A47B123,000$0.28$1.1Go check it out
ChatGLM

THUDM

The GLM series of models from Tsinghua University feature advanced Chinese language understanding and generation capabilities.

Model NameContextInput (/Mt)Cache read (/Mt)Output (/Mt)Operation
GLM-5.1204,800$1.38$0.26$4.4Go check it out
GLM-5V-Turbo204,800$1.2$0.24$4Go check it out
GLM-5-Turbo202,800$1.2$0.24$4Go check it out
GLM-5204,800$1$0.2$3.2Go check it out
GLM-OCR32,000$0.03-$0.03Go check it out
GLM-4.7-Flash200,000$0.07$0.01$0.4Go check it out
GLM-4.7204,800$0.6-$2.2Go check it out
GLM 4.5V65,536$0.6-$1.8Go check it out
GLM-4.5131,072$0.6-$2.2Go check it out
Sao10K logo

Sao10K

A fine-tuned model specifically optimized for creative and role-playing applications, featuring enhanced storytelling capabilities.

Model NameContextInput (/Mt)Output (/Mt)Operation
L3 8B Stheno V3.28,192$0.05$0.05Go check it out
Sao10k L3 8B Lunaris 8,192$0.05$0.05Go check it out
L31 70B Euryale V2.28,192$1.48$1.48Go check it out
L3 70B Euryale V2.1 8,192$1.48$1.48Go check it out
Mistralai logo

Mistralai

A powerful and efficient language model from Mistral AI, designed for both commercial and open-source applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
Mistral Nemo60,288$0.04$0.17Go check it out
Mistral 7B Instruct32,768$0.029$0.059Go check it out
Deepseek logo

Deepseek

Advanced AI models from DeepSeek, offering cutting-edge inference capabilities and competitive pricing for enterprise and research applications.

Model NameContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Operation
Deepseek V4 Flash1,048,576$0.14-$0.028$0.28Go check it out
Deepseek V4 Pro1,048,576$1.74-$0.145$3.48Go check it out
DeepSeek-OCR 28,192$0.03--$0.03Go check it out
DeepSeek V3.1163,840$0.27--$1Go check it out
DeepSeek R1 0528163,840$0.7-$0.35$2.5Go check it out
DeepSeek V3 0324163,840$0.28$0.14 (5m)$0.14$1.14Go check it out
MiniMax logo

MiniMax

MiniMax AI's advanced language model delivers powerful conversational AI capabilities, excelling in customer service, content generation, and creative applications, with robust multilingual support and enterprise-grade scalability.

Model NameContextInput (/Mt)Output (/Mt)Operation
MiniMax M11,000,000$0.55$2.2Go check it out
Gryphe logo

Gryphe

An innovative AI model from Gryphe that offers professional-grade language understanding capabilities, with a focus on efficiency and adaptability, making it ideal for niche applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
Mythomax L2 13B4,096$0.09$0.09Go check it out

Mixture of Experts

A sophisticated collection of state-of-the-art AI models, featuring advanced reasoning and mathematical proof capabilities, as well as cutting-edge language understanding across multiple domains.

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
Qwen3.5-Plus1-256,0001,000,000$0.4--$2.4Go check it out
256,000-1,000,0001,000,000$1.2--$7.2Go check it out
GLM-5.1-204,800$1.38-$0.26$4.4Go check it out
XiaomiMiMo/MiMo-V2.5-Pro1-262,1441,048,576$1-$0.2$3Go check it out
262,144-1,048,5761,048,576$2-$0.4$6Go check it out
OpenAI: GPT OSS 20B-131,072$0.05--$0.2Go check it out
OpenAI GPT OSS 120B-131,072$0.1--$0.5Go check it out
Deepseek V4 Flash-1,048,576$0.14-$0.028$0.28Go check it out
Deepseek V4 Pro-1,048,576$1.74-$0.145$3.48Go check it out
DeepSeek V3.1-163,840$0.27--$1Go check it out
DeepSeek R1 0528-163,840$0.7-$0.35$2.5Go check it out
DeepSeek V3 0324-163,840$0.28$0.14 (5m)$0.14$1.14Go check it out
MiniMax M2.7-highspeed-204,800$0.6-$0.06$2.4Go check it out
MiniMax M2.7-204,800$0.3-$0.03$1.2Go check it out
MiniMax M2.5-highspeed-204,800$0.6-$0.03$2.4Go check it out
MiniMax M2.5-204,800$0.3-$0.03$1.2Go check it out
Minimax M2.1-204,800$0.3$0.375 (5m)$0.03$1.2Go check it out
MiniMax M1-1,000,000$0.55--$2.2Go check it out
GLM-5V-Turbo-204,800$1.2-$0.24$4Go check it out
GLM-5-Turbo-202,800$1.2-$0.24$4Go check it out
GLM-5-204,800$1-$0.2$3.2Go check it out
GLM-4.7-Flash-200,000$0.07-$0.01$0.4Go check it out
GLM-4.7-204,800$0.6--$2.2Go check it out
GLM 4.5V-65,536$0.6--$1.8Go check it out
GLM-4.5-131,072$0.6--$2.2Go check it out
Kimi K2.5-262,144$0.6-$0.1$3Go check it out
Kimi K2 Instruct-131,072$0.57--$2.3Go check it out
Qwen3.5-122B-A10B-262,144$0.4--$3.2Go check it out
Qwen3.5-35B-A3B-262,144$0.25--$2Go check it out
Qwen3.5-397B-A17B-262,144$0.6--$3.6Go check it out
Qwen3 235B A22b Thinking 2507-131,072$0.3--$3Go check it out
Qwen3 30B A3B-40,960$0.09--$0.45Go check it out
Qwen3 32B-40,960$0.1--$0.45Go check it out
Qwen3 235B A22B-40,960$0.2--$0.8Go check it out
ERNIE 4.5 VL 424B A47B-123,000$0.42--$1.25Go check it out
ERNIE 4.5 300B A47B-123,000$0.28--$1.1Go check it out
Llama 4 Maverick Instructions-1,048,576$0.17--$0.85Go check it out
Llama 4 Scout Instructor-131,072$0.1--$0.5Go check it out
Contact Us