Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2604.12374 • Published 3 days ago • 26
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 3 days ago • 71
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published 4 days ago • 128
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs Paper • 2604.10480 • Published 5 days ago • 19
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 4 days ago • 25
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Paper • 2603.27490 • Published 19 days ago • 17
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web Paper • 2604.08516 • Published 8 days ago • 41
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 9 days ago • 181
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 9 days ago • 310
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 8 days ago • 276
ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 8 days ago • 255
LLM-JEPA: Large Language Models Meet Joint Embedding Predictive Architectures Paper • 2509.14252 • Published Sep 11, 2025 • 6
V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning Paper • 2603.14482 • Published Mar 15 • 30
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published Dec 11, 2025 • 56
UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory Paper • 2602.10652 • Published Feb 11 • 4
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding Paper • 2604.05015 • Published 11 days ago • 232
MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control Paper • 2604.06156 • Published 10 days ago • 10