Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper โข 2510.08673 โข Published Oct 9 โข 125
Learning to See and Act: Task-Aware View Planning for Robotic Manipulation Paper โข 2508.05186 โข Published Aug 7
ObjectClear: Complete Object Removal via Object-Effect Attention Paper โข 2505.22636 โข Published May 28 โข 2
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration Paper โข 2406.18516 โข Published Jun 26, 2024 โข 4
ObjCtrl-2.5D: Training-free Object Control with Camera Poses Paper โข 2412.07721 โข Published Dec 10, 2024 โข 8
ObjCtrl-2.5D: Training-free Object Control with Camera Poses Paper โข 2412.07721 โข Published Dec 10, 2024 โข 8
ObjCtrl-2.5D: Training-free Object Control with Camera Poses Paper โข 2412.07721 โข Published Dec 10, 2024 โข 8 โข 2
Image Conductor: Precision Control for Interactive Video Synthesis Paper โข 2406.15339 โข Published Jun 21, 2024 โข 9
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models Paper โข 2406.16863 โข Published Jun 24, 2024 โข 11