PixelDiT: Pixel Diffusion Transformers for Image Generation Paper • 2511.20645 • Published 11 days ago • 24
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression Paper • 2512.05081 • Published 2 days ago • 4
LATTICE: Democratize High-Fidelity 3D Generation at Scale Paper • 2512.03052 • Published 13 days ago • 7
Generative Neural Video Compression via Video Diffusion Prior Paper • 2512.05016 • Published 2 days ago • 7
TV2TV: A Unified Framework for Interleaved Language and Video Generation Paper • 2512.05103 • Published 2 days ago • 10
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation Paper • 2512.04678 • Published 2 days ago • 31
UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers Paper • 2512.04504 • Published 3 days ago • 13
NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation Paper • 2512.05106 • Published 2 days ago • 11
Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model Paper • 2512.01030 • Published 6 days ago • 16
Accelerating Streaming Video Large Language Models via Hierarchical Token Compression Paper • 2512.00891 • Published 6 days ago • 14
CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation Paper • 2512.03540 • Published 4 days ago • 11
Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation Paper • 2512.03534 • Published 4 days ago • 17
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation Paper • 2512.03036 • Published 4 days ago • 20
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper • 2512.03041 • Published 4 days ago • 57
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout Paper • 2511.20649 • Published 11 days ago • 43
Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning Paper • 2511.20549 • Published 11 days ago • 23
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published 9 days ago • 145
Canvas-to-Image: Compositional Image Generation with Multimodal Controls Paper • 2511.21691 • Published 10 days ago • 32