2 11 2

Yuqian Fu

Yuqian-Fu

AI & ML interests

None yet

Recent Activity

upvoted a collection 5 days ago

📝 Research & Long-Form Blog Posts

upvoted a paper 4 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

updated a collection 4 months ago

SRFT

View all activity

Organizations

None yet

upvoted a collection 5 days ago

📝 Research & Long-Form Blog Posts

Collection

In-depth technical articles and research pieces published by Hugging Face • 9 items • Updated 10 days ago • 15

upvoted a paper 4 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 174

updated a collection 4 months ago

SRFT

Collection

5 items • Updated Sep 28, 2025

published 2 models 4 months ago

Yuqian-Fu/SRFT-Qwen2.5-Math-1.5B

2B • Updated Jul 24, 2025 • 2

Yuqian-Fu/SRFT-Qwen2.5-7B-Instruct

8B • Updated Jul 24, 2025

upvoted a paper 4 months ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

Paper • 2509.09677 • Published Sep 11, 2025 • 34

authored a paper 4 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11, 2025 • 47

upvoted 3 papers 4 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11, 2025 • 47

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1, 2025 • 57

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published Sep 4, 2025 • 57

upvoted 2 papers 5 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 83

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11, 2025 • 110

liked a Space 5 months ago

The Ultra-Scale Playbook

🌌

3.65k

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 6 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30, 2025 • 50

updated 3 models 6 months ago

upvoted a paper 6 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 259

liked a dataset 6 months ago

open-thoughts/OpenThoughts-114k

Viewer • Updated Aug 31, 2025 • 228k • 95k • 788

Yuqian Fu

AI & ML interests

Recent Activity

Organizations

Yuqian-Fu's activity

The Ultra-Scale Playbook