Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers Paper • 2601.04890 • Published 7 days ago • 39
view post Post 401 PatchDNA, a DNA foundation model based on Meta's BLT tokenization strategy https://www.biorxiv.org/content/10.1101/2025.11.28.691095v1 See translation 🚀 1 1 + Reply
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 105
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 18
MeXtract: Light-Weight Metadata Extraction from Scientific Papers Paper • 2510.06889 • Published Oct 8, 2025 • 1
view post Post 461 Bio LLMs train on many genomes, but can we encode differences within a species? TomatoTomato adds pangenome tokens to represent a domestic tomato and a wild tomato in one sequence 🍅 🧬 monsoon-nlp/tomatotomato-gLM2-150M-v0.1 See translation 🚀 1 1 + Reply
NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models Paper • 2506.07731 • Published Jun 9, 2025 • 2
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance Paper • 2507.22448 • Published Jul 30, 2025 • 69
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving Paper • 2507.06229 • Published Jul 8, 2025 • 75
PL-Guard: Benchmarking Language Model Safety for Polish Paper • 2506.16322 • Published Jun 19, 2025 • 1
ConECT Dataset: Overcoming Data Scarcity in Context-Aware E-Commerce MT Paper • 2506.04929 • Published Jun 5, 2025 • 2
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30, 2024 • 42
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing Paper • 2206.15076 • Published Jun 30, 2022 • 5
CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews Paper • 2311.12474 • Published Nov 21, 2023 • 1