-
-
-
-
-
-
Inference Providers
Active filters:
RLHF
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
•
8.84k
•
•
237
NousResearch/Nous-Hermes-2-Mistral-7B-DPO
Text Generation
•
7B
•
Updated
•
4.11k
•
217
aaditya/Llama3-OpenBioLLM-70B
Text Generation
•
Updated
•
2.48k
•
489
NiuTrans/robust_visual_reward_model
wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel
Text Generation
•
8B
•
Updated
•
18
•
1
RMSnow/SpeechJudge-GRM
11B
•
Updated
•
48
•
2
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
882
•
13
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
19
•
5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
722
•
26
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
21
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
9
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
9
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
17
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
•
20
•
4
llm-blender/pair-ranker
Text Ranking
•
0.4B
•
Updated
•
18
•
3
nicholasKluge/RewardModelPT
Text Classification
•
0.1B
•
Updated
•
62
nicholasKluge/RewardModel
Text Classification
•
0.1B
•
Updated
•
77
•
1
fb700/chatglm-fitness-RLHF
Updated
•
268
fb700/Bofan-chatglm-Best-lora
Updated
•
14
•
11
kubernetes-bad/Ligma-L2-13b
Updated
•
15
•
3
llm-blender/PairRM
Text Generation
•
Updated
•
444
•
205
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
7B
•
Updated
•
1.19k
•
554
berkeley-nest/Starling-RM-7B-alpha
Updated
•
43
•
103
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
•
7
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
•
8
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
•
6
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
•
6
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
•
6
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
7B
•
Updated
•
3.07k
•
94
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
•
1B
•
Updated
•
22
•
9