ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models Paper • 2603.19466 • Published 4 days ago • 25
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation Paper • 2603.19039 • Published 4 days ago • 42
Specificity-aware reinforcement learning for fine-grained open-world classification Paper • 2603.03197 • Published 20 days ago • 15
Large Multimodal Models as General In-Context Classifiers Paper • 2602.23229 • Published 25 days ago • 26
How to Take a Memorable Picture? Empowering Users with Actionable Feedback Paper • 2602.21877 • Published 26 days ago • 16