PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference Paper • 2603.25730 • Published Mar 26 • 53
Running on Zero Agents Featured 184 VibeVoice-Realtime-0.5B 🐨 184 Generate natural speech from text with selectable voices
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb • May 21, 2025 • 258
view article Article LLaVA-o1: Let Vision Language Models Reason Step-by-Step mikelabs • Nov 19, 2024 • 12
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 308