Large Model API Pricing

Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.

Anthropic logo

Anthropic

Anthropic's Claude model offers advanced AI safety capabilities, focusing on useful, harmless, and honest AI assistants with powerful reasoning and conversational abilities.

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
claude-sonnet-4-61–200,0001,000,000$3$3.75 (5 min) · $6 (1 hr)$0.3$15Go check it out
200,000–1,000,0001,000,000$3$3.75 (5 min) · $6 (1 hr)$0.3$15Go check it out
Claude, Op. 4, No. 61–200,0001,000,000$5$6.25 (5 min) · $10 (1 hr)$0.5$25Go check it out
200,000–1,000,0001,000,000$5$6.25 (5 min) · $10 (1 hr)$0.5$25Go check it out
claude-opus-4-5-20251101-200,000
$4.75 $5
$5.9375 (5 min) · $9.50 (1 hr) $6.25 (5 min) × $10 (1 hr)
$0.475 $0.5
$23.75 $25
Go check it out
claude-haiku-4-5-20251001-20,000$1$1.25 (5 min) · $2 (1 hr)$0.1$5Go check it out
claude-sonnet-4-5-202509291–200,000200,000$3$3.75 (5 min) · $6 (1 hr)$0.3$15Go check it out
200,000–1,000,000200,000$6$7.5 (5 min) × $12 (1 hr)$0.6$22.5Go check it out
claude-3-7-sonnet-20250219-200,000
$2.85 $3
$3.5625 (5m) $3.75 (5-month)
$0.285 $0.3
$14.25 $15
Go check it out
claude-sonnet-4-20250514-200,000
$2.85 $3
$3.5625 (5m) $3.75 (5-month)
$0.285 $0.3
$14.25 $15
Go check it out
claude-opus-4-20250514-200,000
$14.25 $15
$17.8125 (5m) $18.75 (5m)
$1.425 $1.5
$71.25 $75
Go check it out
claude-3-5-sonnet-20241022-200,000
$2.85 $3
$3.5625 (5m) $3.75 (5-month)
$0.285 $0.3
$14.25 $15
Go check it out
claude-3-haiku-20240307-200,000
$0.2375 $0.25
--
$1.1875 $1.25
Go check it out
claude-3-5-haiku-20241022-200,000
$0.76 $0.8
--
$3.8 $4
Go check it out
OpenAI

OpenAI

OpenAI's GPT series of models offer state-of-the-art language understanding and generation capabilities, delivering outstanding performance across a wide range of tasks, and are among the industry's leading AI models.

Model NameInput Token RangeContextInput (/Mt)Cache read (/Mt)Output (/Mt)Actions
gpt-5.4-pro1–272,0001,050,000$30-$180Go check it out
272,000–1,050,0001,050,000$60-$270Go check it out
gpt-5.4-nano-400,000
$0.19 $0.2
$0.019 $0.02
$1.1875 $1.25
Go check it out
gpt-5.4-mini-400,000
$0.7125 $0.75
$0.0712 $0.075
$4.275 $4.5
Go check it out
GPT-5.41–272,0001,050,000$2.5$0.25$15Go check it out
272,000–1,050,0001,050,000$5$0.5$22.5Go check it out
gpt-5.3-codex-400,000
$1.6625 $1.75
$0.1662 $0.175
$13.3 $14
Go check it out
gpt-5.3-chat-latest-128,000
$1.6625 $1.75
$0.1662 $0.175
$13.3 $14
Go check it out
GPT-5.2-400,000
$1.6625 $1.75
$0.1662 $0.175
$13.3 $14
Go check it out
gpt-5.1-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
openai/gpt-oss-120b-131,072$0.1-$0.5Go check it out
gpt-5-codex-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
openai/gpt-oss-20b-131,072$0.05-$0.2Go check it out
gpt-5.1-chat-latest-128,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
GPT-5-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-5-mini-400,000
$0.2375 $0.25
$0.0237 $0.025
$1.9 $2
Go check it out
gpt-5-nano-400,000
$0.0475 $0.05
$0.0047 $0.005
$0.38 $0.4
Go check it out
gpt-5-pro-400,000
$14.25 $15
-
$114 $120
Go check it out
gpt-5.2-codex-400,000$1.75$0.175$14Go check it out
gpt-5.2-pro-400,000
$19.95 $21
-
$159.6 $168
Go check it out
gpt-5.2-chat-latest-128,000
$1.6625 $1.75
$0.1662 $0.175
$13.3 $14
Go check it out
gpt-5.1-codex-max-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-5.1-codex-mini-400,000
$0.2375 $0.25
$0.0237 $0.025
$1.9 $2
Go check it out
gpt-5.1-codex-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-5-chat-latest-400,000
$1.1875 $1.25
$0.1187 $0.125
$9.5 $10
Go check it out
gpt-4.1-mini-1,047,576$0.4$0.1$1.6Go check it out
gpt-4.1-nano-1,047,576$0.1$0.025$0.4Go check it out
gpt-4.1-1,047,576$2$0.5$8Go check it out
GPT-4o-mini-128,000
$0.1425 $0.15
$0.0712 $0.075
$0.57 $0.6
Go check it out
GPT-4O-131,072
$2.375 $2.5
$1.1875 $1.25
$9.5 $10
Go check it out
Gemini logo

Gemini

Google's Gemini model offers high-quality natural language processing capabilities, performs exceptionally well across a wide range of NLP tasks, and boasts powerful multimodal capabilities.

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
gemini-3.1-flash-lite-preview-1,048,576
$0.2375 $0.25
$0.0791 (5 min) × $0.95 (1 hr) $0.0833 (5 min) × $1 (1 hr)
$0.0237 $0.025
$1.425 $1.5
Go check it out
gemini-3.1-pro-preview1–204,8001,048,576$2$0.375 (5 min) × $4.5 (1 hr)$0.2$12Go check it out
204,800–1,048,5761,048,576$4$0.375 (5 min) × $4.5 (1 hr)$0.4$18Go check it out
google/gemma-3-12b-it-131,072$0.05--$0.1Go check it out
gemini-2.5-flash-1,048,576
$0.285 $0.3
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0285 $0.03
$2.375 $2.5
Go check it out
gemini-3-flash-preview-1,048,576
$0.475 $0.5
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0475 $0.05
$2.85 $3
Go check it out
gemini-3-pro-preview1–204,8001,048,576$2-$0.2$12Go check it out
204,800–1,048,5761,048,576$4-$0.4$18Go check it out
gemini-2.5-flash-lite-preview-09-2025-1,048,576
$0.095 $0.1
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0095 $0.01
$0.38 $0.4
Go check it out
gemini-2.0-flash-lite-1,048,576
$0.0712 $0.075
--
$0.285 $0.3
Go check it out
gemini-2.5-flash-lite-1,048,576
$0.095 $0.1
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0095 $0.01
$0.38 $0.4
Go check it out
gemini-2.5-pro-1,048,576
$1.1875 $1.25
$0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr)
$0.1187 $0.125
$9.5 $10
Go check it out
gemini-2.5-flash-lite-preview-06-17-1,048,576
$0.095 $0.1
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0095 $0.01
$0.38 $0.4
Go check it out
gemini-2.5-flash-preview-05-20-1,048,576
$0.1425 $0.15
$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)
$0.0285 $0.03
$3.325 $3.5
Go check it out
gemini-2.5-pro-preview-06-05-1,048,576
$1.1875 $1.25
$0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr)
$0.1187 $0.125
$9.5 $10
Go check it out
gemini-2.0-flash-20250609-1,048,576
$0.1425 $0.15
--
$0.57 $0.6
Go check it out
google/gemma-3-27b-it-32,768$0.119--$0.2Go check it out
Llama logo

Llama

Meta's Llama model offers state-of-the-art language understanding capabilities and features an open architecture, making it suitable for a wide range of applications.

Qwen logo

Qwen

The Qwen series of models offers powerful natural language processing capabilities and is available in a range of parameter sizes, from lightweight to enterprise-grade solutions.

Wenxin

Baidu

Baidu's ERNIE model offers advanced Chinese language understanding and multimodal capabilities, is optimized for Chinese applications, and is competitively priced.

Model NameContextInput (/Mt)Output (/Mt)Operation
baidu/ernie-4.5-vl-424b-a47b123,000$0.42$1.25Go check it out
baidu/ernie-4.5-300b-a47b-paddle123,000$0.28$1.1Go check it out
ChatGLM

THUDM

The GLM series of models from Tsinghua University feature advanced Chinese language understanding and generation capabilities.

Model NameContextInput (/Mt)Cache read (/Mt)Output (/Mt)Operation
zai-org/glm-4.5v65,536$0.6-$1.8Go check it out
zai-org/glm-4.5131,072$0.6-$2.2Go check it out
zai-org/glm-5204,800$1$0.2$3.2Go check it out
zai-org/glm-ocr32,000$0.03-$0.03Go check it out
zai-org/glm-4.7-flash200,000$0.07$0.01$0.4Go check it out
zai-org/glm-4.7204,800$0.6-$2.2Go check it out
Sao10K logo

Sao10K

A fine-tuned model specifically optimized for creative and role-playing applications, featuring enhanced storytelling capabilities.

Model NameContextInput (/Mt)Output (/Mt)Operation
Sao10K/L3-8B-Stheno-v3.28,192$0.05$0.05Go check it out
sao10k/l3-8b-lunaris8,192$0.05$0.05Go check it out
sao10k/l31-70b-euryale-v2.28,192$1.48$1.48Go check it out
sao10k/l3-70b-euryale-v2.18,192$1.48$1.48Go check it out
Mistralai logo

Mistralai

A powerful and efficient language model from Mistral AI, designed for both commercial and open-source applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
mistralai/mistral-nemo60,288$0.04$0.17Go check it out
mistralai/mistral-7b-instruct32,768$0.029$0.059Go check it out
Deepseek logo

Deepseek

Advanced AI models from DeepSeek, offering cutting-edge inference capabilities and competitive pricing for enterprise and research applications.

Model NameContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Operation
deepseek/deepseek-v3.1163,840$0.27--$1Go check it out
deepseek/deepseek-ocr-28,192$0.03--$0.03Go check it out
deepseek/deepseek-r1-0528163,840$0.7-$0.35$2.5Go check it out
deepseek/deepseek-v3-0324163,840$0.28$0.14 (5m)$0.14$1.14Go check it out
MiniMax logo

MiniMax

MiniMax AI's advanced language model delivers powerful conversational AI capabilities, excelling in customer service, content generation, and creative applications, with robust multilingual support and enterprise-grade scalability.

Model NameContextInput (/Mt)Output (/Mt)Operation
minimaxai/minimax-m1-80k1,000,000$0.55$2.2Go check it out
Gryphe logo

Gryphe

An innovative AI model from Gryphe that offers professional-grade language understanding capabilities, with a focus on efficiency and adaptability, making it ideal for niche applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
gryphe/mythomax-l2-13b4,096$0.09$0.09Go check it out

Mixture of Experts

A sophisticated collection of state-of-the-art AI models, featuring advanced reasoning and mathematical proof capabilities, as well as cutting-edge language understanding across multiple domains.

Contact Us