6 31 10

Qiguang Chen

LightChen2333

https://lightchen233.github.io/

LightChen233

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Controlled Self-Evolution for Algorithmic Code Optimization

upvoted a paper 8 days ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

upvoted a paper 10 days ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

View all activity

Organizations

upvoted 2 papers 8 days ago

Controlled Self-Evolution for Algorithmic Code Optimization

Paper • 2601.07348 • Published 11 days ago • 110

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published 8 days ago • 121

upvoted a paper 10 days ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Paper • 2601.07779 • Published 10 days ago • 26

upvoted a paper 11 days ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published 13 days ago • 49

upvoted an article 17 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

581

upvoted a paper about 1 month ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 64

upvoted 9 papers 3 months ago

ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models

Paper • 2510.06014 • Published Oct 7, 2025 • 10

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 212

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 97

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

Paper • 2510.19600 • Published Oct 22, 2025 • 69

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 179

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 142

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9, 2025 • 27

AutoPR: Let's Automate Your Academic Promotion!

Paper • 2510.09558 • Published Oct 10, 2025 • 53

BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions

Paper • 2510.05318 • Published Oct 6, 2025 • 22

upvoted 3 papers 4 months ago

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Paper • 2509.26490 • Published Sep 30, 2025 • 20

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published Sep 29, 2025 • 19

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

upvoted 2 papers 5 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 118

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20, 2025 • 85

Qiguang Chen

AI & ML interests

Recent Activity

Organizations

LightChen2333's activity

We Got Claude to Fine-Tune an Open Source LLM