Kimi K2 Thinking gguf

One of best thinking models. modified-mit license (see moonshotai/Kimi-K2-Thinking)

Evaluation

Reasoning Tasks

Benchmark Setting K2 Thinking GPT-5
(High)
Claude Sonnet 4.5
(Thinking)
K2 0905 DeepSeek-V3.2 Grok-4
HLE (Text-only) no tools 23.9 26.3 19.8* 7.9 19.8 25.4
w/ tools 44.9 41.7* 32.0* 21.7 20.3* 41.0
heavy 51.0 42.0 - - - 50.7
AIME25 no tools 94.5 94.6 87.0 51.0 89.3 91.7
w/ python 99.1 99.6 100.0 75.2 58.1* 98.8
heavy 100.0 100.0 - - - 100.0
HMMT25 no tools 89.4 93.3 74.6* 38.8 83.6 90.0
w/ python 95.1 96.7 88.8* 70.4 49.5* 93.9
heavy 97.5 100.0 - - - 96.7
IMO-AnswerBench no tools 78.6 76.0* 65.9* 45.8 76.0* 73.1
GPQA no tools 84.5 85.7 83.4 74.2 79.9 87.5

Agentic Search Tasks

Benchmark Setting K2 Thinking GPT-5
(High)
Claude Sonnet 4.5
(Thinking)
K2 0905 DeepSeek-V3.2
BrowseComp w/ tools 60.2 54.9 24.1 7.4 40.1
BrowseComp-ZH w/ tools 62.3 63.0* 42.4* 22.2 47.9
Seal-0 w/ tools 56.3 51.4* 53.4* 25.2 38.5*
FinSearchComp-T3 w/ tools 47.4 48.5* 44.0* 10.4 27.0*
Frames w/ tools 87.0 86.0* 85.0* 58.1 80.2*
Downloads last month
194
GGUF
Model size
1T params
Architecture
deepseek2
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ling1000T/Kimi-K2-Thinking-gguf

Quantized
(13)
this model