-
-
-
-
-
-
Inference Providers
Active filters:
RLinf
Reinforcement Learning
•
2B
•
Updated
•
5
•
1
Text Generation
•
8B
•
Updated
•
3
•
3
mradermacher/RLinf-math-1.5B-GGUF
2B
•
Updated
•
40
mradermacher/RLinf-math-7B-GGUF
Reinforcement Learning
•
8B
•
Updated
•
41
•
1
mradermacher/RLinf-math-1.5B-i1-GGUF
2B
•
Updated
•
117
mradermacher/RLinf-math-7B-i1-GGUF
Reinforcement Learning
•
8B
•
Updated
•
105
•
1
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-object
Reinforcement Learning
•
8B
•
Updated
•
4
RLinf/RLinf-OpenVLA-GRPO-ManiSkill3-25ood
Reinforcement Learning
•
8B
•
Updated
•
2
RLinf/RLinf-OpenVLAOFT-GRPO-ManiSkill3-25ood
Reinforcement Learning
•
8B
•
Updated
•
3
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-goal
Reinforcement Learning
•
8B
•
Updated
•
1
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-spatial
Reinforcement Learning
•
8B
•
Updated
•
5
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-long
Reinforcement Learning
•
8B
•
Updated
•
1
RLinf/RLinf-OpenVLA-PPO-ManiSkill3-25ood
Reinforcement Learning
•
8B
•
Updated
•
4
RLinf/RLinf-OpenVLAOFT-PPO-ManiSkill3-25ood
Reinforcement Learning
•
8B
•
Updated
•
5
RLinf/RLinf-OpenVLAOFT-ManiSkill-Base-Lora
Reinforcement Learning
•
Updated
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-90
Reinforcement Learning
•
8B
•
Updated
•
2
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora
Reinforcement Learning
•
8B
•
Updated
•
24
RLinf/RLinf-OpenVLAOFT-LIBERO-130
Reinforcement Learning
•
8B
•
Updated
•
8
•
2
RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning
•
8B
•
Updated
•
60