Home / Kimi K2 0905
moonshotai/kimi-k2-0905

Kimi K2 0905

moonshotai/kimi-k2-0905
Kimi K2 0905 is the September update to Kimi K2 0711. It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring a total of 1 trillion parameters, with 32 billion active during each forward pass. It supports long-context inference of up to 256k tokens, an increase from the previous 128k. This update improves agentic coding with higher accuracy and better generalization across scaffolds, and enhances frontend coding with more aesthetically pleasing and functional outputs for web, 3D, and related tasks. Kimi K2 is optimized for agentic capabilities, including advanced tool use, reasoning, and code synthesis. It excels across coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use (Tau2, AceBench) benchmarks. The model is trained using a novel stack that incorporates the MuonClip optimizer for stable, large-scale MoE training.
Price
Input$0.6 per million tokens
Output$2.5 per million tokens

Use the following code example to integrate our API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.jiekou.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="moonshotai/kimi-k2-0905",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=262144,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Information

Provider
Quantification
fp8

Supported Features

Context length
262144
Maximum output
262144
Function call
Support
Structured output
Support
serverless
Support
Input Capabilities
text
Output Capabilities
text
Contact Us