BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration Paper β’ 2510.00438 β’ Published Oct 1, 2025 β’ 9
MobileCLIP2 Collection MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B β’ 37 items β’ Updated Sep 18, 2025 β’ 57
FastVLM Collection Efficient Vision Encoding for Vision Language Models β’ 9 items β’ Updated Sep 2, 2025 β’ 106
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing Paper β’ 2508.10881 β’ Published Aug 14, 2025 β’ 52
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper β’ 2508.01191 β’ Published Aug 2, 2025 β’ 238
EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion Paper β’ 2507.16535 β’ Published Jul 22, 2025 β’ 20
Seed-X Collection A powerful open-source multilingual translation language model series, including instruction and reasoning models. β’ 8 items β’ Updated Aug 22, 2025 β’ 66
XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation Paper β’ 2506.21416 β’ Published Jun 26, 2025 β’ 28
view article Article π€ππ¬π₯οΈπ Kimi-VL-A3B-Thinking-2506: A Quick Navigation Jun 21, 2025 β’ 74
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. β’ 9 items β’ Updated about 22 hours ago β’ 376
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 β’ 7 items β’ Updated 13 days ago β’ 161
SkyReels-A2: Compose Anything in Video Diffusion Transformers Paper β’ 2504.02436 β’ Published Apr 3, 2025 β’ 39
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper β’ 2503.11647 β’ Published Mar 14, 2025 β’ 146
Wan2.1 14B 480p I2V LoRAs Collection A collection of Remade's Wan2.1 14B 480p I2V LoRAs β’ 49 items β’ Updated May 24, 2025 β’ 208
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org β’ 12 items β’ Updated 21 days ago β’ 141