Reward Models
updated
nvidia/Llama-3.3-Nemotron-70B-Reward-Multilingual
Text Generation
• 71B • Updated • 41
• • 11
nvidia/Llama-3.3-Nemotron-70B-Reward-Principle
Text Generation
• 71B • Updated • 302
• 7
nvidia/Qwen-3-Nemotron-32B-Reward
Text Classification
• 32B • Updated • 87
• 20
Skywork/Skywork-Reward-V2-Llama-3.1-8B
Text Classification
• 8B • Updated • 29.6k
• 44
Text Classification
• 8B • Updated • 140
• 9
allenai/Llama-3.1-70B-Instruct-RM-RB2
Text Classification
• Updated • 33
• 1
allenai/Llama-3.1-8B-Instruct-RM-RB2
Text Classification
• Updated • 205
• 1
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
• 8B • Updated • 17.5k
• 184
nvidia/Llama-3.3-Nemotron-70B-Select
Text Generation
• 71B • Updated • 43
• • 12
nvidia/Llama-3.3-Nemotron-70B-Edit
Text Generation
• 71B • Updated • 33
• • 4
nvidia/Llama-3.3-Nemotron-70B-Feedback
Text Generation
• 71B • Updated • 39
• • 9
allenai/Llama-3.1-Tulu-3-8B-RM
Text Classification
• 8B • Updated • 1.73k
• 19
Text Classification
• 73B • Updated • 68.1k
• 83
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
• 8B • Updated • 147
• 25
NCSOFT/Llama-3-OffsetBias-8B
Text Generation
• 8B • Updated • 11
• • 15
nvidia/Qwen2.5-CascadeRL-RM-72B
Text Generation
• 71B • Updated • 1.85k
• 13
general-preference/GPM-Llama-3.1-8B
8B • Updated • 22
• 1