TideGS: Scalable Training of Over One Billion 3D Gaussian Splatting Primitives via Out-of-Core Optimization Paper • 2605.20150 • Published 4 days ago • 6
RT-Splatting: Joint Reflection-Transmission Modeling with Gaussian Splatting Paper • 2605.18263 • Published 5 days ago • 8
UniT: Unified Geometry Learning with Group Autoregressive Transformer Paper • 2605.21131 • Published 3 days ago • 5
RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments Paper • 2604.26067 • Published 25 days ago • 73
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons Paper • 2604.28130 • Published 23 days ago • 22
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 26 days ago • 118
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published 29 days ago • 63
FlowAnchor: Stabilizing the Editing Signal for Inversion-Free Video Editing Paper • 2604.22586 • Published 29 days ago • 16
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 29 days ago • 226
StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition Paper • 2604.21689 • Published about 1 month ago • 25
LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics Paper • 2604.17295 • Published Apr 19 • 85
Seeing Fast and Slow: Learning the Flow of Time in Videos Paper • 2604.21931 • Published about 1 month ago • 19
WorldMark: A Unified Benchmark Suite for Interactive Video World Models Paper • 2604.21686 • Published about 1 month ago • 36
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published Apr 15 • 162
Learning Long-term Motion Embeddings for Efficient Kinematics Generation Paper • 2604.11737 • Published Apr 13 • 6
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published Apr 13 • 72