-
-
-
-
-
-
Inference Providers
Active filters:
open-r1
HayatoHongoEveryonesAI/llm-jp-4-8b-instruct-sft-long-v5
Text Generation
•
266k
•
Updated
•
11
•
2
Text Generation
•
8B
•
Updated
•
3
•
1
Text Generation
•
8B
•
Updated
•
5
•
1
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
8
yucaiwen/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
5
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO
Text Generation
•
8B
•
Updated
•
7
•
1
JinnP/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
3
bangan/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
6
liusq19/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
3
stepyoun/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
5
howey/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
7
wxnfifth/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
5
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
•
8B
•
Updated
•
11
Text Generation
•
8B
•
Updated
•
5
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math
Text Generation
•
2B
•
Updated
•
8
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
•
2B
•
Updated
•
8
skzxjus/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
5
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-GGUF
8B
•
Updated
•
570
skzxjus/Qwen2.5-7B-1m-Open-R1-Distill
Text Generation
•
8B
•
Updated
•
12
•
4
skzxjus/Qwen2.5-7B-Open-R1-GRPO
Text Generation
•
8B
•
Updated
•
8
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
5
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-i1-GGUF
8B
•
Updated
•
361
yeshsurya/Qwen2.5-7B-Math-with_50stepGRPO
Text Generation
•
8B
•
Updated
•
6
mradermacher/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math-GGUF
2B
•
Updated
•
96
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-GGUF
8B
•
Updated
•
398
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
•
8B
•
Updated
•
4
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
•
8B
•
Updated
•
4
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
•
2B
•
Updated
•
3
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
•
2B
•
Updated
•
2
yh-yao/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
4