FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring Paper • 2512.04390 • Published 3 days ago • 4
UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers Paper • 2512.04504 • Published 3 days ago • 13
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Paper • 2512.05111 • Published 2 days ago • 37
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published 3 days ago • 126
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding Paper • 2512.04000 • Published 3 days ago • 2
PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation Paper • 2512.04025 • Published 3 days ago • 2
Light-X: Generative 4D Video Rendering with Camera and Illumination Control Paper • 2512.05115 • Published 2 days ago • 3
Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment Paper • 2511.22345 • Published 9 days ago • 12
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation Paper • 2512.05076 • Published 2 days ago • 4
SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs Paper • 2512.04746 • Published 2 days ago • 8
On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral Paper • 2512.04220 • Published 3 days ago • 8
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 3 days ago • 135
TV2TV: A Unified Framework for Interleaved Language and Video Generation Paper • 2512.05103 • Published 2 days ago • 10
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Paper • 2512.04987 • Published 2 days ago • 60
SIMA 2: A Generalist Embodied Agent for Virtual Worlds Paper • 2512.04797 • Published 2 days ago • 11
SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization Paper • 2512.02631 • Published 5 days ago • 6
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published 5 days ago • 29
4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer Paper • 2512.05060 • Published 2 days ago • 17
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation Paper • 2512.04678 • Published 3 days ago • 31