Home / Qwen3 Coder Next FP8
qwen/qwen3-coder-next

Qwen3 Coder Next FP8

qwen/qwen3-coder-next
Qwen3-Coder-Next is an open-weight language model specifically designed for coding agents and local development environments. This highly efficient model delivers exceptional performance with only 3 billion active parameters out of a total of 80 billion, achieving results comparable to models with 10 to 20 times more active parameters while maintaining remarkable cost-effectiveness for agent deployment. Through its sophisticated training methodology, Qwen3-Coder-Next excels in advanced agentic capabilities, including long-horizon reasoning, complex tool usage, and robust recovery from execution failures, ensuring reliable performance across dynamic coding tasks. The model’s versatility is further enhanced by its 256k context length and adaptability to various scaffold templates, enabling seamless integration with diverse CLI/IDE platforms such as Claude Code, Qwen Code, Qoder, Kilo, Trae, and Cline, making it an ideal solution for comprehensive development environments.
Price
Input$0.2 per million tokens
Output$1.5 per million tokens

Use the following code example to integrate our API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.jiekou.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="qwen/qwen3-coder-next",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=65536,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Information

Provider
Quantification
fp8

Supported Features

Context length
262144
Maximum output
65536
Function call
Support
Structured output
Support
serverless
Support
Input Capabilities
text
Output Capabilities
text
Contact Us