Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 5 days ago • 38
Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale Paper • 2512.10398 • Published Dec 11, 2025 • 13
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints Paper • 2601.18137 • Published 5 days ago • 23
Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification Paper • 2601.15808 • Published 9 days ago • 20
Endless Terminals: Scaling RL Environments for Terminal Agents Paper • 2601.16443 • Published 8 days ago • 16
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents Paper • 2601.16746 • Published 8 days ago • 86
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 9 days ago • 53
Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics Paper • 2601.14027 • Published 11 days ago • 12
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning Paper • 2601.14750 • Published 10 days ago • 17
Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published 10 days ago • 42
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge Paper • 2601.08808 • Published 18 days ago • 38
SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations Paper • 2512.14080 • Published Dec 16, 2025 • 8
An Empirical Study of Testing Practices in Open Source AI Agent Frameworks and Agentic Applications Paper • 2509.19185 • Published Sep 23, 2025 • 4