ReHyAt: Recurrent Hybrid Attention for Video Diffusion Transformers Paper • 2601.04342 • Published 5 days ago • 3
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling Paper • 2601.03111 • Published 6 days ago • 8
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs Paper • 2601.03559 • Published 5 days ago • 10
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search Paper • 2601.04767 • Published 4 days ago • 24
RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published 4 days ago • 26
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Paper • 2601.05175 • Published 4 days ago • 27
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers Paper • 2601.04890 • Published 4 days ago • 39
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 4 days ago • 149
Unified Thinker: A General Reasoning Modular Core for Image Generation Paper • 2601.03127 • Published 6 days ago • 7
Parallel Latent Reasoning for Sequential Recommendation Paper • 2601.03153 • Published 6 days ago • 2
OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs Paper • 2601.01592 • Published 8 days ago • 11
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models Paper • 2601.03044 • Published 6 days ago • 26
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 8 days ago • 35
MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning Paper • 2512.23412 • Published 14 days ago • 36
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision Paper • 2601.03193 • Published 6 days ago • 42
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 6 days ago • 94