Kimi K2 0905

moonshotai/kimi-k2-0905

Kimi K2 0905 is the September update to Kimi K2 0711. It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring a total of 1 trillion parameters, with 32 billion active during each forward pass. It supports long-context inference of up to 256k tokens, an increase from the previous 128k. This update improves agentic coding with higher accuracy and better generalization across scaffolds, and enhances frontend coding with more aesthetically pleasing and functional outputs for web, 3D, and related tasks. Kimi K2 is optimized for agentic capabilities, including advanced tool use, reasoning, and code synthesis. It excels across coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use (Tau2, AceBench) benchmarks. The model is trained using a novel stack that incorporates the MuonClip optimizer for stable, large-scale MoE training.

Price

Input	$0.6 per million tokens
Output	$2.5 per million tokens

Use the following code example to integrate our API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.highwayapi.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="moonshotai/kimi-k2-0905",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=100352,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Information

Provider

Moonshot

Quantification

fp8

Supported Features

Context length

262144

Maximum output

100352

Function call

Support

Structured output

Support

serverless

Support

Input Capabilities

text

Output Capabilities

text