KVAE 2.0 Collection KVAE 2.0 is a family of video tokenizers with a time compression ratio of 4 and spacial compression ratio of 8 and 16 • 2 items • Updated Apr 16 • 4
KVAE-Audio Collection KVAE-Audio is a continuous full-band audio waveform autoencoder • 1 item • Updated 4 days ago • 6
LTX-2.3 Creative Lab Collection LoRAs and IC-LoRAs, trained on the LTX-2.3 model • 19 items • Updated 8 days ago • 60
AAD-1: Asymmetric Adversarial Distillation for One-Step Autoregressive Video Generation Paper • 2606.03972 • Published Jun 2 • 14
SwiftVR: Real-Time One-Step Generative Video Restoration Paper • 2606.09516 • Published 25 days ago • 17
Cosmos3 Collection Omnimodal World Models for Physical AI • 18 items • Updated about 14 hours ago • 137
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published May 27 • 75
RTDMD Collection Reinforcing Few-step Generators via Reward-Tilted Distribution Matching • 5 items • Updated about 1 month ago • 3
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published May 25 • 7
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published May 22 • 46
SEGA: Spectral-Energy Guided Attention for Resolution Extrapolation in Diffusion Transformers Paper • 2605.22668 • Published May 21 • 41
Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos Paper • 2605.18233 • Published May 18 • 93
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation Paper • 2605.13724 • Published May 13 • 105
CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives Paper • 2605.12496 • Published May 12 • 30
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published May 12 • 194