Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision Paper • 2604.04934 • Published 3 days ago • 27 • 4
Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning Paper • 2604.05404 • Published 1 day ago • 26 • 3
ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation Paper • 2604.03922 • Published 4 days ago • 41 • 3
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents Paper • 2604.06132 • Published 1 day ago • 88 • 3
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding Paper • 2604.05015 • Published 3 days ago • 191 • 6
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing Paper • 2604.04911 • Published 3 days ago • 28 • 3
AURA: Always-On Understanding and Real-Time Assistance via Video Streams Paper • 2604.04184 • Published 4 days ago • 41 • 3
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 3 days ago • 78 • 4
LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models Paper • 2603.28301 • Published 10 days ago • 74 • 5
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale Paper • 2604.04771 • Published 3 days ago • 97 • 4
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published 3 days ago • 162 • 12
Communicating about Space: Language-Mediated Spatial Integration Across Partial Views Paper • 2603.27183 • Published 11 days ago • 15 • 3
Test-Time Scaling Makes Overtraining Compute-Optimal Paper • 2604.01411 • Published 8 days ago • 19 • 4
Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence? Paper • 2604.03016 • Published 6 days ago • 27 • 3
Token Warping Helps MLLMs Look from Nearby Viewpoints Paper • 2604.02870 • Published 6 days ago • 26 • 4
A Simple Baseline for Streaming Video Understanding Paper • 2604.02317 • Published 7 days ago • 65 • 6
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper • 2604.01161 • Published 7 days ago • 29 • 5
QuitoBench: A High-Quality Open Time Series Forecasting Benchmark Paper • 2603.26017 • Published 13 days ago • 31 • 3