OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows Paper • 2510.24411 • Published Oct 28 • 71
R1-Fuzz: Specializing Language Models for Textual Fuzzing via Reinforcement Learning Paper • 2509.20384 • Published Sep 21 • 2
AgentFold: Long-Horizon Web Agents with Proactive Context Management Paper • 2510.24699 • Published Oct 28 • 67
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis Paper • 2510.24695 • Published Oct 28 • 22
AgentFold: Long-Horizon Web Agents with Proactive Context Management Paper • 2510.24699 • Published Oct 28 • 67
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis Paper • 2510.24695 • Published Oct 28 • 22
BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions Paper • 2510.05318 • Published Oct 6 • 21
R1-Fuzz: Specializing Language Models for Textual Fuzzing via Reinforcement Learning Paper • 2509.20384 • Published Sep 21 • 2
ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance Paper • 2312.08852 • Published Dec 13, 2023
FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction Paper • 2304.00902 • Published Apr 3, 2023
Towards General Agentic Intelligence via Environment Scaling Paper • 2509.13311 • Published Sep 16 • 71