15 41 4

Peng Xia

richardxp888

https://richard-peng-xia.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

authored a paper 1 day ago

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

upvoted a paper 1 day ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

View all activity

Organizations

upvoted a paper about 4 hours ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 3 days ago • 193

upvoted a paper 1 day ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published 4 days ago • 28

upvoted a paper 6 days ago

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published 7 days ago • 25

upvoted a paper 21 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 22 days ago • 136

upvoted a paper about 1 month ago

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Paper • 2602.23166 • Published Feb 26 • 44

upvoted 3 papers about 2 months ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 350

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 74

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published Feb 2 • 140

upvoted a paper 2 months ago

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs

Paper • 2602.05258 • Published Feb 5 • 7

upvoted 2 papers 3 months ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 59

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 37

upvoted 3 papers 4 months ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 51

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

Paper • 2511.19900 • Published Nov 25, 2025 • 49

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

Paper • 2511.19304 • Published Nov 24, 2025 • 92

upvoted 5 papers 5 months ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 110

ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models

Paper • 2510.06014 • Published Oct 7, 2025 • 10

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 60

InteractScience: Programmatic and Visually-Grounded Evaluation of Interactive Scientific Demonstration Code Generation

Paper • 2510.09724 • Published Oct 10, 2025 • 11

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published Oct 28, 2025 • 100

upvoted a paper 6 months ago

QueST: Incentivizing LLMs to Generate Difficult Problems

Paper • 2510.17715 • Published Oct 20, 2025 • 35

Peng Xia

AI & ML interests

Recent Activity

Organizations

richardxp888's activity