One Model, Many Latencies: Universal Speech Enhancement for Diverse Real-Time Applications Paper • 2606.25621 • Published 3 days ago • 13
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients Paper • 2606.18216 • Published 11 days ago • 61
BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering Paper • 2606.17049 • Published 12 days ago • 27
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning Paper • 2606.13673 • Published 16 days ago • 106
Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models Paper • 2606.12412 • Published 17 days ago • 20
YoCausal: How Far is Video Generation from World Model? A Causality Perspective Paper • 2605.30346 • Published 30 days ago • 55
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published about 1 month ago • 93
Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion Paper • 2605.25449 • Published May 25 • 21
Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching Paper • 2602.12280 • Published Feb 12 • 34
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding Paper • 2601.09575 • Published Jan 14 • 26
3AM: Segment Anything with Geometric Consistency in Videos Paper • 2601.08831 • Published Jan 13 • 34
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes Paper • 2601.05249 • Published Jan 8 • 48
Generative Refocusing: Flexible Defocus Control from a Single Image Paper • 2512.16923 • Published Dec 18, 2025 • 39
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery Paper • 2510.15869 • Published Oct 17, 2025 • 50
LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal Paper • 2510.15868 • Published Oct 17, 2025 • 27
StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions Paper • 2510.02314 • Published Oct 2, 2025 • 61
See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation Paper • 2509.22653 • Published Sep 26, 2025 • 25
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Paper • 2508.14041 • Published Aug 19, 2025 • 59
GCC: Generative Color Constancy via Diffusing a Color Checker Paper • 2502.17435 • Published Feb 24, 2025 • 30
AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting Paper • 2502.05176 • Published Feb 7, 2025 • 40