Xu Yifan's picture

3 6

Xu Yifan

xuyifan

·

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation

authored a paper about 1 month ago

AlignBench: Benchmarking Chinese Alignment of Large Language Models

authored a paper about 1 month ago

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

View all activity

Organizations

None yet

authored 12 papers about 1 month ago

GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation

Paper • 2303.14655 • Published Mar 26, 2023

AlignBench: Benchmarking Chinese Alignment of Large Language Models

Paper • 2311.18743 • Published Nov 30, 2023 • 1

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

Paper • 2306.07906 • Published Jun 13, 2023 • 13

AgentBench: Evaluating LLMs as Agents

Paper • 2308.03688 • Published Aug 7, 2023 • 25

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Paper • 2404.02893 • Published Apr 3, 2024 • 22

GLM-130B: An Open Bilingual Pre-trained Model

Paper • 2210.02414 • Published Oct 5, 2022 • 3

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Paper • 2406.12793 • Published Jun 18, 2024 • 33

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17

AutoGLM: Autonomous Foundation Agents for GUIs

Paper • 2411.00820 • Published Oct 28, 2024 • 2

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31, 2024 • 49

AndroidGen: Building an Android Language Agent under Data Scarcity

Paper • 2504.19298 • Published Apr 27

AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Paper • 2510.04206 • Published Oct 5 • 2

upvoted a paper 5 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 240

upvoted a paper 7 months ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19 • 82

upvoted a paper 12 months ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published Dec 16, 2024 • 18

upvoted 2 papers about 1 year ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 36

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31, 2024 • 49

New activity in meta-llama/Llama-3.2-11B-Vision about 1 year ago

Error encountered when fine-tuning

#30 opened about 1 year ago by

commented a paper over 1 year ago

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Paper • 2404.02893 • Published Apr 3, 2024 • 22 •

upvoted a paper over 1 year ago

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Paper • 2404.02893 • Published Apr 3, 2024 • 22