A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5 Paper • 2601.10527 • Published 3 days ago • 16
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning Paper • 2601.05593 • Published 9 days ago • 76
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 12 days ago • 121
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 13 days ago • 28
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published Dec 5, 2025 • 38
REASONEDIT: Towards Reasoning-Enhanced Image Editing Models Paper • 2511.22625 • Published Nov 27, 2025 • 46
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 227
REASONEDIT: Towards Reasoning-Enhanced Image Editing Models Paper • 2511.22625 • Published Nov 27, 2025 • 46
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation Paper • 2511.20635 • Published Nov 25, 2025 • 32
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation Paper • 2511.20635 • Published Nov 25, 2025 • 32