Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space about 21 hours ago

lewtun/parameter-golf-experiments

published a Space about 21 hours ago

lewtun/parameter-golf-experiments

liked a dataset 2 days ago

willdepueoai/parameter-golf

View all activity

Organizations

submitted 2 papers to Daily Papers about 1 month ago

Single-minus gluon tree amplitudes are nonzero

Paper • 2602.12176 • Published Feb 12 • 8

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published Feb 3 • 12

authored a paper 10 months ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

Paper • 2504.11354 • Published Apr 15, 2025 • 6

authored a paper 12 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 207

authored 2 papers about 1 year ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10, 2025 • 48

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 257

authored 3 papers about 2 years ago

RAFT: A Real-World Few-Shot Text Classification Benchmark

Paper • 2109.14076 • Published Sep 28, 2021 • 2

Efficient Few-Shot Learning Without Prompts

Paper • 2209.11055 • Published Sep 22, 2022 • 4

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Paper • 2206.11249 • Published Jun 22, 2022

authored a paper over 2 years ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123

authored a paper almost 3 years ago

AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages

Paper • 2303.12582 • Published Mar 22, 2023 • 23

authored a paper about 3 years ago

Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements

Paper • 2210.01970 • Published Sep 30, 2022 • 13