Zero-To-CAD Collection Datasets (1M & 100K) and model for synthesizing executable CAD programs from an LLM in a CadQuery environment. No real data used. • 3 items • Updated 9 days ago • 10
1930 Coder Collection Fine-tuning the Talkie 13B 1930 model on agentic trajectories • 3 items • Updated 2 days ago • 3
talkie-13b Collection talkie-1930-13b is a vintage language model trained on pre-1931 English-language text. See https://github.com/talkie-lm/talkie to run talkie. • 3 items • Updated 14 days ago • 45
view article Article CircleGuardBench: New Standard for Evaluating AI Moderation Models May 7, 2025 • 60
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published Mar 16 • 153
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 142
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 28 days ago • 19
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 55
Robust Speech Recognition via Large-Scale Weak Supervision Paper • 2212.04356 • Published Dec 6, 2022 • 53
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output Feb 7 • 22
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model +6 Oct 29, 2024 • 61