CodeGoat24
's Collections
UnifiedReward Training Data
updated
Unified Reward Model for Multimodal Understanding and Generation
Paper
•
2503.05236
•
Published
•
122
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement
Fine-Tuning
Paper
•
2505.03318
•
Published
•
92
CodeGoat24/UnifiedReward-2.0-T2X-score-data
Viewer
•
Updated
•
337k
•
192
CodeGoat24/ImageGen-CoT-Reward-5K
Viewer
•
Updated
•
5.54k
•
122
•
1
CodeGoat24/LLaVA-Critic-113k
Preview
•
Updated
•
176
Viewer
•
Updated
•
21.4k
•
74
CodeGoat24/ShareGPTVideo-DPO
Viewer
•
Updated
•
101k
•
70
Viewer
•
Updated
•
29k
•
181
Preview
•
Updated
•
153
Viewer
•
Updated
•
73.2k
•
51
Viewer
•
Updated
•
72.7k
•
64
Viewer
•
Updated
•
19k
•
59