Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
RLHFlow
's Collections
Reinforce-Ada
Minimal-RL
Online-DPO-R1
Decision-Tree Reward Models
RLHFlow MATH Process Reward Model
Standard-format-preference-dataset
Mixture-of-preference-reward-modeling
RM-Bradley-Terry
PM-pair
Online RLHF
RLHFLow Reward Models
SFT Models
Decision-Tree Reward Models
updated
Mar 2
Upvote
1
RLHFlow/Decision-Tree-Reward-Gemma-2-27B
Text Classification
•
27B
•
Updated
Jan 24, 2025
•
29
•
8
RLHFlow/Decision-Tree-Reward-Llama-3.1-8B
Text Classification
•
8B
•
Updated
Jan 24, 2025
•
50
•
7
RLHFlow/LLM-Preferences-HelpSteer2
Viewer
•
Updated
Feb 5, 2025
•
9.13k
•
102
•
1
Upvote
1
Share collection
View history
Collection guide
Browse collections