đ¤ Smol-Data Collection Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing ⢠14 items ⢠Updated 26 days ago ⢠12
dLLM: Simple Diffusion Language Modeling Paper ⢠2602.22661 ⢠Published about 1 month ago ⢠151
Claude 4.5 Opus Collection Distilled models and datasets for Claude 4.5 Opus. ⢠14 items ⢠Updated 27 days ago ⢠30
Stable Code Collection Suite of developer assistant models ⢠5 items ⢠Updated Jan 9, 2025 ⢠45
PockEngine: Sparse and Efficient Fine-tuning in a Pocket Paper ⢠2310.17752 ⢠Published Oct 26, 2023 ⢠15