view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge NormalUhr • Feb 7, 2025 • 292
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 611
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 1.12k
view article Article Open-Source Handwritten Signature Detection Model samuellimabraz • Mar 14, 2025 • 121
view article Article Hugging Face welcomes the Aya Expanse family of multilingual models ariG23498 • Oct 24, 2024 • 10
Traditional Chinese LLM Corpus Collection Traditional Chinese corpus collection for LLM training (pre-training, instruction-tuning, and RLHF/alignment). • 21 items • Updated Mar 2 • 8
view article Article How we leveraged distilabel to create an Argilla 2.0 Chatbot +3 plaguss, gabrielmbmb, sdiazlor, osanseviero, dvilasuero • Jul 16, 2024 • 33
📚Traditional Chinese Translation Dataset Collection 收集繁體中文在語言模型上存在多國語言翻譯的資料集,例如:中轉英、中轉越南等。繁體中文與東亞、東南亞關係密切,需考量未來延展性 • 3 items • Updated Feb 2, 2024 • 3