-
-
-
-
-
-
Inference Providers
Active filters:
rl
Text Generation
•
4B
•
Updated
•
1
Text Generation
•
4B
•
Updated
•
1
Text Generation
•
4B
•
Updated
•
1
Text Generation
•
4B
•
Updated
•
1
Text Generation
•
4B
•
Updated
•
1
Text Generation
•
4B
•
Updated
•
1
Text Generation
•
4B
•
Updated
•
1
Text Generation
•
4B
•
Updated
•
2
Text Generation
•
4B
•
Updated
•
1
HarleyCooper/Qwen3-30B-Dakota1890
Text Generation
•
Updated
•
2
HerrHruby/offline_acemath_rl_4b_inst_hard_with_dishsoap_16k_no_summ_curr_step_120
Text Generation
•
4B
•
Updated
•
27
HarleyCooper/Qwen3-30B-ThinkingMachines-Dakota1890
Reinforcement Learning
•
Updated
•
1
Text Generation
•
21B
•
Updated
•
24
mradermacher/CAI-20B-v2-GGUF
Text Generation
•
21B
•
Updated
•
44
mradermacher/CAI-20B-v2-i1-GGUF
Text Generation
•
21B
•
Updated
•
144
socaitcy/SOCAIT-Hermes-14B
Text Generation
•
Updated
ash256/qwen3-4b-question-gen
Text Generation
•
4B
•
Updated
•
4
•
1
pankajmathur/nanochat-d34-rl-all-ckpts
Text Generation
•
Updated
•
1
pankajmathur/nanochat-d34-rl
Text Generation
•
Updated
HallD/SkeptiSTEM-4B-v2-stageR3-grpo-lora
Text Generation
•
Updated
Any-to-Any
•
7B
•
Updated
•
438
ModalityDance/Omni-R1-Zero
Any-to-Any
•
7B
•
Updated
•
328
ibrahima2222/nanochat-d32
Updated
IIGroup/X-Coder-RL-Qwen2.5-7B
8B
•
Updated
•
77
•
1
IIGroup/X-Coder-RL-Qwen3-8B
8B
•
Updated
•
78
•
1
mradermacher/X-Coder-RL-Qwen3-8B-GGUF
8B
•
Updated
•
292
mradermacher/X-Coder-RL-Qwen2.5-7B-GGUF
mradermacher/X-Coder-RL-Qwen3-8B-i1-GGUF
8B
•
Updated
•
1.09k
•
1
mradermacher/X-Coder-RL-Qwen2.5-7B-i1-GGUF
8B
•
Updated
•
241
Text Generation
•
4B
•
Updated
•
241