DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning Paper • 2603.12257 • Published 2 days ago • 26 • 2
SoundWeaver: Semantic Warm-Starting for Text-to-Audio Diffusion Serving Paper • 2603.07865 • Published 6 days ago • 2 • 3
Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition Paper • 2603.12046 • Published 2 days ago • 1 • 2
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published 2 days ago • 47 • 2
PACED: Distillation at the Frontier of Student Competence Paper • 2603.11178 • Published 3 days ago • 3 • 2
The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training Paper • 2603.10444 • Published 4 days ago • 6 • 2
OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams Paper • 2603.12265 • Published 2 days ago • 8 • 2
Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training Paper • 2603.12246 • Published 2 days ago • 4 • 2
A Mixed Diet Makes DINO An Omnivorous Vision Encoder Paper • 2602.24181 • Published 15 days ago • 1 • 2
Attention Sinks Are Provably Necessary in Softmax Transformers: Evidence from Trigger-Conditional Tasks Paper • 2603.11487 • Published 3 days ago • 1 • 2
Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge Paper • 2603.11665 • Published 3 days ago • 2 • 1
FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System Paper • 2603.10420 • Published 4 days ago • 3 • 2
DVD: Deterministic Video Depth Estimation with Generative Priors Paper • 2603.12250 • Published 2 days ago • 16 • 2
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use Paper • 2603.11076 • Published 4 days ago • 4 • 2
GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing Paper • 2603.12264 • Published 2 days ago • 14 • 2
Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning Paper • 2603.11653 • Published 3 days ago • 2 • 2
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation Paper • 2603.12247 • Published 2 days ago • 22 • 2