Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published 2 days ago • 42
LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction Paper • 2509.07403 • Published Sep 9, 2025 • 34
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning Paper • 2509.13305 • Published Sep 16, 2025 • 91
WebSailor: Navigating Super-human Reasoning for Web Agent Paper • 2507.02592 • Published Jul 3, 2025 • 123
PhyX: Does Your Model Have the "Wits" for Physical Reasoning? Paper • 2505.15929 • Published May 21, 2025 • 49
Running 3.66k The Ultra-Scale Playbook 🌌 3.66k The ultimate guide to training LLM on large GPU Clusters