mlfoundations-cua-dev/ubuntu_traj_rl_bbox_max_500_samples_max_turns_10 Viewer • Updated Nov 3 • 1k • 57
mlfoundations-cua-dev/ubuntu_traj_rl_bbox_max_500_samples_max_turns_10 Viewer • Updated Nov 3 • 1k • 57
🍨 Gelato Collection From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents • 5 items • Updated 23 days ago
🍨 Gelato-30B-A3B Checkpoints Collection Intermediate checkpoints for Gelato-30B-A3B. Refer to https://github.com/mlfoundations/gelato for more details. • 29 items • Updated Oct 29