Kimi K2 Thinking gguf
One of best thinking models. modified-mit license (see moonshotai/Kimi-K2-Thinking)
Evaluation
Reasoning Tasks
| Benchmark | Setting | K2 Thinking | GPT-5 (High) |
Claude Sonnet 4.5 (Thinking) |
K2 0905 | DeepSeek-V3.2 | Grok-4 |
|---|---|---|---|---|---|---|---|
| HLE (Text-only) | no tools | 23.9 | 26.3 | 19.8* | 7.9 | 19.8 | 25.4 |
| w/ tools | 44.9 | 41.7* | 32.0* | 21.7 | 20.3* | 41.0 | |
| heavy | 51.0 | 42.0 | - | - | - | 50.7 | |
| AIME25 | no tools | 94.5 | 94.6 | 87.0 | 51.0 | 89.3 | 91.7 |
| w/ python | 99.1 | 99.6 | 100.0 | 75.2 | 58.1* | 98.8 | |
| heavy | 100.0 | 100.0 | - | - | - | 100.0 | |
| HMMT25 | no tools | 89.4 | 93.3 | 74.6* | 38.8 | 83.6 | 90.0 |
| w/ python | 95.1 | 96.7 | 88.8* | 70.4 | 49.5* | 93.9 | |
| heavy | 97.5 | 100.0 | - | - | - | 96.7 | |
| IMO-AnswerBench | no tools | 78.6 | 76.0* | 65.9* | 45.8 | 76.0* | 73.1 |
| GPQA | no tools | 84.5 | 85.7 | 83.4 | 74.2 | 79.9 | 87.5 |
Agentic Search Tasks
| Benchmark | Setting | K2 Thinking | GPT-5 (High) |
Claude Sonnet 4.5 (Thinking) |
K2 0905 | DeepSeek-V3.2 |
|---|---|---|---|---|---|---|
| BrowseComp | w/ tools | 60.2 | 54.9 | 24.1 | 7.4 | 40.1 |
| BrowseComp-ZH | w/ tools | 62.3 | 63.0* | 42.4* | 22.2 | 47.9 |
| Seal-0 | w/ tools | 56.3 | 51.4* | 53.4* | 25.2 | 38.5* |
| FinSearchComp-T3 | w/ tools | 47.4 | 48.5* | 44.0* | 10.4 | 27.0* |
| Frames | w/ tools | 87.0 | 86.0* | 85.0* | 58.1 | 80.2* |
- Downloads last month
- 194
Hardware compatibility
Log In
to view the estimation
2-bit
3-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for ling1000T/Kimi-K2-Thinking-gguf
Base model
moonshotai/Kimi-K2-Thinking