Qwen3 Coder Next FP8

qwen/qwen3-coder-next

Qwen3-Coder-Next is an open-weight language model specifically designed for coding agents and local development environments. This highly efficient model delivers exceptional performance with only 3 billion active parameters out of a total of 80 billion, achieving results comparable to models with 10 to 20 times more active parameters while maintaining remarkable cost-effectiveness for agent deployment. Through its sophisticated training methodology, Qwen3-Coder-Next excels in advanced agentic capabilities, including long-horizon reasoning, complex tool usage, and robust recovery from execution failures, ensuring reliable performance across dynamic coding tasks. The model’s versatility is further enhanced by its 256k context length and adaptability to various scaffold templates, enabling seamless integration with diverse CLI/IDE platforms such as Claude Code, Qwen Code, Qoder, Kilo, Trae, and Cline, making it an ideal solution for comprehensive development environments.

Price

Input	$0.2 per million tokens
Output	$1.5 per million tokens

Use the following code example to integrate our API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.highwayapi.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="qwen/qwen3-coder-next",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=65536,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Information

Provider

Alibaba

Quantification

fp8

Supported Features

Context length

262144

Maximum output

65536

Function call

Support

Structured output

Support

serverless

Support

Input Capabilities

text

Output Capabilities

text