FlashSampling: Fast and Memory-Efficient Exact Sampling Paper • 2603.15854 • Published 2 days ago • 3
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 2 days ago • 218
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 9 days ago • 63
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 7 days ago • 115
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 Feb 12 • 31
view article Article IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST 28 days ago • 18
Nemotron-Terminal Collection We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 2 days ago • 32
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 27 days ago • 487
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents Paper • 2512.12730 • Published Dec 14, 2025 • 50
When Models Manipulate Manifolds: The Geometry of a Counting Task Paper • 2601.04480 • Published Jan 8 • 4
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published Feb 11 • 193
SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia Paper • 2602.01618 • Published Feb 2 • 2
SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published Feb 2 • 61