RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models Paper • 2603.21341 • Published 23 days ago • 23
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 22 days ago • 46
RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning Paper • 2602.18742 • Published Feb 21 • 11
Vision-aligned Latent Reasoning for Multi-modal Large Language Model Paper • 2602.04476 • Published Feb 4 • 14
Contrastive Representation Regularization for Vision-Language-Action Models Paper • 2510.01711 • Published Oct 2, 2025 • 4
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models Paper • 2407.15841 • Published Jul 22, 2024 • 39