Mamba-3: Improved Sequence Modeling using State Space Principles Paper • 2603.15569 • Published 2 days ago • 4 • 1
view changelog Hugging Face Changelog Repositories Usage Overview in Settings about 15 hours ago • 24
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 8 days ago • 172
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 15 days ago • 96 • 5
Running 381 Visualize Dataset (v2.0+ latest dataset format) 💻 381 Visualize LeRobot datasets in an interactive web tool
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 19 days ago • 94 • 3
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published Feb 13 • 34 • 3
view article Article Did GPT 5.2 make a breakthrough discovery in theoretical physics? 27 days ago • 61