定价 - 控制台

新增国内直连 Base URL： https://api.highwayapi.ai/openai，原域名继续提供服务，详见产品文档

Anthropic

Anthropic的Claude模型提供先进的安全AI能力，专注于有用、无害、诚实的AI助手体验，并具备强大的推理和对话能力。

Model Name	Input Token Range	Context	Input (/Mt)	Cache Write (/Mt)	Cache read (/Mt)	Output (/Mt)	Actions
claude-opus-4-8	-	1,000,000	$5	$6.25 (5 min) · $10 (1 hr)	$0.5	$25	Go check it out
claude-opus-4-8-r	-	1,000,000	$1$5	$1.25(5m)·$2(1h)$6.25(5m)·$10(1h)	$0.1$0.5	$5$25	Go check it out
claude-opus-4-7	-	1,000,000	$4.75 $5	$5.9375 (5 min) · $9.50 (1 hr) $6.25 (5 min) × $10 (1 hr)	$0.475 $0.5	$23.75 $25	Go check it out
claude-opus-4-7-r	-	1,000,000	$1$5	$1.25(5m)·$2(1h)$6.25(5m)·$10(1h)	$0.1$0.5	$5$25	Go check it out
Claude, Op. 4, No. 6	1–200,000	1,000,000	$5	$6.25 (5 min) · $10 (1 hr)	$0.5	$25	Go check it out
Claude, Op. 4, No. 6	200,000–1,000,000	1,000,000	$5	$6.25 (5 min) · $10 (1 hr)	$0.5	$25	Go check it out
claude-opus-4-6-r	-	1,000,000	$1$5	$1.25(5m)·$2(1h)$6.25(5m)·$10(1h)	$0.1$0.5	$5$25	Go check it out
claude-opus-4-6-dd	-	1,000,000	$2.75$5	$3.4375(5m)·$5.5(1h)$6.25(5m)·$10(1h)	$0.275$0.5	$13.75$25	Go check it out
claude-sonnet-4-6	1–200,000	1,000,000	$3	$3.75 (5 min) · $6 (1 hr)	$0.3	$15	Go check it out
claude-sonnet-4-6	200,000–1,000,000	1,000,000	$3	$3.75 (5 min) · $6 (1 hr)	$0.3	$15	Go check it out
claude-sonnet-4-6-r	-	1,000,000	$0.6$3	$0.75(5m)·$1.2(1h)$3.75(5m)·$6(1h)	$0.06$0.3	$3$15	Go check it out
claude-sonnet-4-6-dd	-	1,000,000	$1.65$3	$2.0625(5m)·$3.3(1h)$3.75(5m)·$6(1h)	$0.165$0.3	$8.25$15	Go check it out
claude-opus-4-5-20251101	-	200,000	$4.75 $5	$5.9375 (5 min) · $9.50 (1 hr) $6.25 (5 min) × $10 (1 hr)	$0.475 $0.5	$23.75 $25	Go check it out
claude-opus-4-5-20251101-dd	-	200,000	$2.75$5	$3.4375(5m)$6.25(5m)	$0.275$0.5	$13.75$25	Go check it out
claude-sonnet-4-5-20250929	1–200,000	200,000	$3	$3.75 (5 min) · $6 (1 hr)	$0.3	$15	Go check it out
claude-sonnet-4-5-20250929	200,000–1,000,000	200,000	$6	$7.5 (5 min) × $12 (1 hr)	$0.6	$22.5	Go check it out
claude-sonnet-4-5-20250929-dd	-	200,000	$1.65$3	$2.0625(5m)$3.75(5m)	$0.165$0.3	$8.25$15	Go check it out
claude-haiku-4-5-20251001	-	20,000	$1	$1.25 (5 min) · $2 (1 hr)	$0.1	$5	Go check it out
claude-haiku-4-5-20251001-r	-	200,000	$0.2$1	$0.25(5m)·$0.4(1h)$1.25(5m)·$2(1h)	$0.02$0.1	$1$5	Go check it out
claude-haiku-4-5-20251001-dd	-	200,000	$0.55$1	$0.6875(5m)·$1.1(1h)$1.25(5m)·$2(1h)	$0.055$0.1	$2.75$5	Go check it out
claude-sonnet-4-20250514	-	200,000	$2.85 $3	$3.5625 (5m) $3.75 (5-month)	$0.285 $0.3	$14.25 $15	Go check it out

OpenAI

OpenAI's GPT series of models offer state-of-the-art language understanding and generation capabilities, delivering outstanding performance across a wide range of tasks, and are among the industry's leading AI models.

Model Name	Input Token Range	Context	Input (/Mt)	Cache read (/Mt)	Output (/Mt)	Actions
gpt-5.5	1–272,000	1,050,000	$5	$0.5	$30	Go check it out
gpt-5.5	272,000–1,050,000	1,050,000	$10	$1	$45	Go check it out
gpt-5.5-pro	1–272,000	1,050,000	$30	-	$180	Go check it out
gpt-5.5-pro	272,000–1,050,000	1,050,000	$60	-	$270	Go check it out
gpt-5.5-r	-	1,050,000	$0.25$5	$0.025$0.5	$1.5$30	Go check it out
gpt-5.5-light	1–272,000	1,050,000	$0.25$5	$0.025$0.5	$1.5$30	Go check it out
gpt-5.5-light	272,000–1,050,000	1,050,000	$0.5$10	$0.05$1	$2.25$45	Go check it out
gpt-5.4-nano	-	400,000	$0.19 $0.2	$0.019 $0.02	$1.1875 $1.25	Go check it out
gpt-5.4-mini	-	400,000	$0.7125 $0.75	$0.0712 $0.075	$4.275 $4.5	Go check it out
gpt-5.4-pro	1–272,000	1,050,000	$30	-	$180	Go check it out
gpt-5.4-pro	272,000–1,050,000	1,050,000	$60	-	$270	Go check it out
GPT-5.4	1–272,000	1,050,000	$2.5	$0.25	$15	Go check it out
GPT-5.4	272,000–1,050,000	1,050,000	$5	$0.5	$22.5	Go check it out
gpt-5.3-chat-latest	-	128,000	$1.6625 $1.75	$0.1662 $0.175	$13.3 $14	Go check it out
gpt-5.3-codex	-	400,000	$1.6625 $1.75	$0.1662 $0.175	$13.3 $14	Go check it out
gpt-5.2-codex	-	400,000	$1.75	$0.175	$14	Go check it out
GPT-5.2	-	400,000	$1.6625 $1.75	$0.1662 $0.175	$13.3 $14	Go check it out
gpt-5.2-pro	-	400,000	$19.95 $21	-	$159.6 $168	Go check it out
gpt-5.2-chat-latest	-	128,000	$1.6625 $1.75	$0.1662 $0.175	$13.3 $14	Go check it out
gpt-5.1-codex-max	-	400,000	$1.1875 $1.25	$0.1187 $0.125	$9.5 $10	Go check it out
gpt-5.1-codex-mini	-	400,000	$0.2375 $0.25	$0.0237 $0.025	$1.9 $2	Go check it out
gpt-5.1-codex	-	400,000	$1.1875 $1.25	$0.1187 $0.125	$9.5 $10	Go check it out
gpt-5.1-chat-latest	-	128,000	$1.1875 $1.25	$0.1187 $0.125	$9.5 $10	Go check it out
gpt-5.1	-	400,000	$1.1875 $1.25	$0.1187 $0.125	$9.5 $10	Go check it out
gpt-5-pro	-	400,000	$14.25 $15	-	$114 $120	Go check it out
gpt-5-codex	-	400,000	$1.1875 $1.25	$0.1187 $0.125	$9.5 $10	Go check it out
gpt-5-chat-latest	-	400,000	$1.1875 $1.25	$0.1187 $0.125	$9.5 $10	Go check it out
gpt-5-nano	-	400,000	$0.0475 $0.05	$0.0047 $0.005	$0.38 $0.4	Go check it out
gpt-5-mini	-	400,000	$0.2375 $0.25	$0.0237 $0.025	$1.9 $2	Go check it out
GPT-5	-	400,000	$1.1875 $1.25	$0.1187 $0.125	$9.5 $10	Go check it out
OpenAI: GPT OSS 20B	-	131,072	$0.05	-	$0.2	Go check it out
OpenAI GPT OSS 120B	-	131,072	$0.1	-	$0.5	Go check it out
gpt-4.1-mini	-	1,047,576	$0.4	$0.1	$1.6	Go check it out
gpt-4.1-nano	-	1,047,576	$0.1	$0.025	$0.4	Go check it out
gpt-4.1	-	1,047,576	$2	$0.5	$8	Go check it out
GPT-4o-mini	-	128,000	$0.1425 $0.15	$0.0712 $0.075	$0.57 $0.6	Go check it out
GPT-4O	-	131,072	$2.375 $2.5	$1.1875 $1.25	$9.5 $10	Go check it out

Gemini

Google的Gemini模型提供高质量的语言处理能力，在各种NLP任务中表现出色，并具备强大的多模态能力。

Model Name	Input Token Range	Context	Input (/Mt)	Cache Write (/Mt)	Cache read (/Mt)	Output (/Mt)	Actions
gemini-3.1-pro-preview	1–204,800	1,048,576	$2	$0.375 (5 min) × $4.5 (1 hr)	$0.2	$12	Go check it out
gemini-3.1-pro-preview	204,800–1,048,576	1,048,576	$4	$0.375 (5 min) × $4.5 (1 hr)	$0.4	$18	Go check it out
gemini-3.1-flash-lite-preview	-	1,048,576	$0.2375 $0.25	$0.0791 (5 min) × $0.95 (1 hr) $0.0833 (5 min) × $1 (1 hr)	$0.0237 $0.025	$1.425 $1.5	Go check it out
gemini-3-flash-preview	-	1,048,576	$0.475 $0.5	$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)	$0.0475 $0.05	$2.85 $3	Go check it out
gemini-2.5-flash-lite-preview-09-2025	-	1,048,576	$0.095 $0.1	$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)	$0.0095 $0.01	$0.38 $0.4	Go check it out
gemini-2.5-flash-lite	-	1,048,576	$0.095 $0.1	$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)	$0.0095 $0.01	$0.38 $0.4	Go check it out
gemini-2.5-pro	-	1,048,576	$1.1875 $1.25	$0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr)	$0.1187 $0.125	$9.5 $10	Go check it out
gemini-2.5-flash	-	1,048,576	$0.285 $0.3	$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)	$0.0285 $0.03	$2.375 $2.5	Go check it out
gemini-2.5-flash-lite-preview-06-17	-	1,048,576	$0.095 $0.1	$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)	$0.0095 $0.01	$0.38 $0.4	Go check it out
gemini-2.5-flash-preview-05-20	-	1,048,576	$0.1425 $0.15	$0.0788 (5 min) × $0.95 (1 hr) $0.083 (5 min) × $1 (1 hr)	$0.0285 $0.03	$3.325 $3.5	Go check it out
gemini-2.5-pro-preview-06-05	-	1,048,576	$1.1875 $1.25	$0.3562 (5-month) × $4.275 (1-hour) $0.375 (5 min) × $4.5 (1 hr)	$0.1187 $0.125	$9.5 $10	Go check it out
gemini-2.0-flash-lite	-	1,048,576	$0.0712 $0.075	-	-	$0.285 $0.3	Go check it out
gemini-2.0-flash-20250609	-	1,048,576	$0.1425 $0.15	-	-	$0.57 $0.6	Go check it out
Gemma3 12B	-	131,072	$0.05	-	-	$0.1	Go check it out
Gemma 3 27B	-	32,768	$0.119	-	-	$0.2	Go check it out
gemini-3.5-flash	-	1,048,576	$1.5	$0.083(5m)·$1(1h)	$0.15	$9	Go check it out

Llama

Meta's Llama model offers state-of-the-art language understanding capabilities and features an open architecture, making it suitable for a wide range of applications.

Model Name	Context	Input (/Mt)	Output (/Mt)	Operation
Llama 4 Maverick Instructions	1,048,576	$0.17	$0.85	Go check it out
Llama 4 Scout Instructor	131,072	$0.1	$0.5	Go check it out
Llama 3.3 70B Instruct	131,072	$0.13	$0.39	Go check it out
Llama 3.2 3B Instruct	32,768	$0.03	$0.05	Go check it out
Llama 3.1 8B Instruct	16,384	$0.02	$0.05	Go check it out

Qwen

Qwen系列模型提供高效的语言处理能力，具有多种参数规模，涵盖从轻量级到企业级的解决方案。

Model Name	Input Token Range	Context	Input (/Mt)	Output (/Mt)	Actions
Qwen3.5-Plus	1-256,000	1,000,000	$0.4	$2.4	Go check it out
Qwen3.5-Plus	256,000-1,000,000	1,000,000	$1.2	$7.2	Go check it out
Qwen3.5-27B	-	262,144	$0.3	$2.4	Go check it out
Qwen3.5-122B-A10B	-	262,144	$0.4	$3.2	Go check it out
Qwen3.5-35B-A3B	-	262,144	$0.25	$2	Go check it out
Qwen3.5-397B-A17B	-	262,144	$0.6	$3.6	Go check it out
Qwen3 Coder Next FP8	-	262,144	$0.2	$1.5	Go check it out
Qwen3 Next 80B A3B Instruct	-	65,536	$0.15	$1.5	Go check it out
Qwen3 Next 80B A3B Thinking	-	65,536	$0.15	$1.5	Go check it out
Qwen MT Plus	-	4,096	$0.25	$0.75	Go check it out
Qwen3 235B A22b Thinking 2507	-	131,072	$0.3	$3	Go check it out
Qwen3 Coder 480B A35B Instructions	-	262,144	$0.29	$1.2	Go check it out
Qwen3 235B A22B Instruct 2507	-	131,072	$0.15	$0.8	Go check it out
Qwen3 30B A3B	-	40,960	$0.09	$0.45	Go check it out
Qwen3 32B	-	40,960	$0.1	$0.45	Go check it out
Qwen3 235B A22B	-	40,960	$0.2	$0.8	Go check it out
Qwen 2.5 7B Instruct	-	32,000	$0.07	$0.07	Go check it out
Qwen 2.5 VL 72B Instruction Manual	-	32,768	$0.8	$0.8	Go check it out
Qwen 2.5 72B Instruct	-	32,000	$0.38	$0.4	Go check it out

Baidu

百度的ERNIE模型提供先进的中文语言理解和多模态能力，针对中文应用进行了优化，并具备具有竞争力的价格。

Model Name	Context	Input (/Mt)	Output (/Mt)	Operation
ERNIE 4.5 VL 424B A47B	123,000	$0.42	$1.25	Go check it out
ERNIE 4.5 300B A47B	123,000	$0.28	$1.1	Go check it out

THUDM

来自清华大学的GLM系列模型，具备先进的中文语言理解和生成能力。

Model Name	Context	Input (/Mt)	Cache read (/Mt)	Output (/Mt)	Operation
GLM-5.1	204,800	$1.38	$0.26	$4.4	Go check it out
GLM-5V-Turbo	204,800	$1.2	$0.24	$4	Go check it out
GLM-5-Turbo	202,800	$1.2	$0.24	$4	Go check it out
GLM-5	204,800	$1	$0.2	$3.2	Go check it out
GLM-OCR	32,000	$0.03	-	$0.03	Go check it out
GLM-4.7-Flash	200,000	$0.07	$0.01	$0.4	Go check it out
GLM-4.7	204,800	$0.6	-	$2.2	Go check it out
GLM 4.5V	65,536	$0.6	-	$1.8	Go check it out
GLM-4.5	131,072	$0.6	-	$2.2	Go check it out

Sao10K

A fine-tuned model specifically optimized for creative and role-playing applications, featuring enhanced storytelling capabilities.

Model Name	Context	Input (/Mt)	Output (/Mt)	Operation
L3 8B Stheno V3.2	8,192	$0.05	$0.05	Go check it out
Sao10k L3 8B Lunaris	8,192	$0.05	$0.05	Go check it out
L31 70B Euryale V2.2	8,192	$1.48	$1.48	Go check it out
L3 70B Euryale V2.1	8,192	$1.48	$1.48	Go check it out

Mistralai

A powerful and efficient language model from Mistral AI, designed for both commercial and open-source applications.

Model Name	Context	Input (/Mt)	Output (/Mt)	Operation
Mistral Nemo	60,288	$0.04	$0.17	Go check it out
Mistral 7B Instruct	32,768	$0.029	$0.059	Go check it out

Deepseek

Advanced AI models from DeepSeek, offering cutting-edge inference capabilities and competitive pricing for enterprise and research applications.

Model Name	Context	Input (/Mt)	Cache Write (/Mt)	Cache read (/Mt)	Output (/Mt)	Operation
Deepseek V4 Flash	1,048,576	$0.14	-	$0.028	$0.28	Go check it out
Deepseek V4 Pro	1,048,576	$1.74	-	$0.145	$3.48	Go check it out
DeepSeek-OCR 2	8,192	$0.03	-	-	$0.03	Go check it out
DeepSeek V3.1	163,840	$0.27	-	-	$1	Go check it out
DeepSeek R1 0528	163,840	$0.7	-	$0.35	$2.5	Go check it out
DeepSeek V3 0324	163,840	$0.28	$0.14 (5m)	$0.14	$1.14	Go check it out

MiniMax

MiniMax AI的先进语言模型提供强大的对话AI能力，在客户服务、内容生成和创意应用中表现优异，并具备强大的多语言支持和企业级可扩展性。

Model Name	Context	Input (/Mt)	Output (/Mt)	Operation
MiniMax M1	1,000,000	$0.55	$2.2	Go check it out

Gryphe

An innovative AI model from Gryphe that offers professional-grade language understanding capabilities, with a focus on efficiency and adaptability, making it ideal for niche applications.

Model Name	Context	Input (/Mt)	Output (/Mt)	Operation
Mythomax L2 13B	4,096	$0.09	$0.09	Go check it out

Mixture of Experts

最先进AI模型的高级集合，具备高级推理、数学证明能力以及跨多个领域的前沿语言理解能力。

Model Name	Input Token Range	Context	Input (/Mt)	Cache Write (/Mt)	Cache read (/Mt)	Output (/Mt)	Actions
Qwen3.5-Plus	1-256,000	1,000,000	$0.4	-	-	$2.4	Go check it out
Qwen3.5-Plus	256,000-1,000,000	1,000,000	$1.2	-	-	$7.2	Go check it out
GLM-5.1	-	204,800	$1.38	-	$0.26	$4.4	Go check it out
XiaomiMiMo/MiMo-V2.5-Pro	1-262,144	1,048,576	$1	-	$0.2	$3	Go check it out
XiaomiMiMo/MiMo-V2.5-Pro	262,144-1,048,576	1,048,576	$2	-	$0.4	$6	Go check it out
OpenAI: GPT OSS 20B	-	131,072	$0.05	-	-	$0.2	Go check it out
OpenAI GPT OSS 120B	-	131,072	$0.1	-	-	$0.5	Go check it out
Deepseek V4 Flash	-	1,048,576	$0.14	-	$0.028	$0.28	Go check it out
Deepseek V4 Pro	-	1,048,576	$1.74	-	$0.145	$3.48	Go check it out
DeepSeek V3.1	-	163,840	$0.27	-	-	$1	Go check it out
DeepSeek R1 0528	-	163,840	$0.7	-	$0.35	$2.5	Go check it out
DeepSeek V3 0324	-	163,840	$0.28	$0.14 (5m)	$0.14	$1.14	Go check it out
MiniMax M2.7-highspeed	-	204,800	$0.6	-	$0.06	$2.4	Go check it out
MiniMax M2.7	-	204,800	$0.3	-	$0.03	$1.2	Go check it out
MiniMax M2.5-highspeed	-	204,800	$0.6	-	$0.03	$2.4	Go check it out
MiniMax M2.5	-	204,800	$0.3	-	$0.03	$1.2	Go check it out
Minimax M2.1	-	204,800	$0.3	$0.375 (5m)	$0.03	$1.2	Go check it out
MiniMax M1	-	1,000,000	$0.55	-	-	$2.2	Go check it out
GLM-5V-Turbo	-	204,800	$1.2	-	$0.24	$4	Go check it out
GLM-5-Turbo	-	202,800	$1.2	-	$0.24	$4	Go check it out
GLM-5	-	204,800	$1	-	$0.2	$3.2	Go check it out
GLM-4.7-Flash	-	200,000	$0.07	-	$0.01	$0.4	Go check it out
GLM-4.7	-	204,800	$0.6	-	-	$2.2	Go check it out
GLM 4.5V	-	65,536	$0.6	-	-	$1.8	Go check it out
GLM-4.5	-	131,072	$0.6	-	-	$2.2	Go check it out
Kimi K2.5	-	262,144	$0.6	-	$0.1	$3	Go check it out
Kimi K2 Instruct	-	131,072	$0.57	-	-	$2.3	Go check it out
Qwen3.5-122B-A10B	-	262,144	$0.4	-	-	$3.2	Go check it out
Qwen3.5-35B-A3B	-	262,144	$0.25	-	-	$2	Go check it out
Qwen3.5-397B-A17B	-	262,144	$0.6	-	-	$3.6	Go check it out
Qwen3 235B A22b Thinking 2507	-	131,072	$0.3	-	-	$3	Go check it out
Qwen3 30B A3B	-	40,960	$0.09	-	-	$0.45	Go check it out
Qwen3 32B	-	40,960	$0.1	-	-	$0.45	Go check it out
Qwen3 235B A22B	-	40,960	$0.2	-	-	$0.8	Go check it out
ERNIE 4.5 VL 424B A47B	-	123,000	$0.42	-	-	$1.25	Go check it out
ERNIE 4.5 300B A47B	-	123,000	$0.28	-	-	$1.1	Go check it out
Llama 4 Maverick Instructions	-	1,048,576	$0.17	-	-	$0.85	Go check it out
Llama 4 Scout Instructor	-	131,072	$0.1	-	-	$0.5	Go check it out