Shaobai Jiang's picture

Shaobai Jiang

shaobaij

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 10 hours ago

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

upvoted a paper about 10 hours ago

SERA: Soft-Verified Efficient Repository Agents

upvoted a paper about 11 hours ago

Advancing Open-source World Models

View all activity

Organizations

None yet

upvoted 2 papers about 10 hours ago

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Paper • 2601.18778 • Published 5 days ago • 38

SERA: Soft-Verified Efficient Repository Agents

Paper • 2601.20789 • Published 3 days ago • 8

upvoted a paper about 11 hours ago

Advancing Open-source World Models

Paper • 2601.20540 • Published 3 days ago • 90

upvoted 2 papers 2 days ago

A Pragmatic VLA Foundation Model

Paper • 2601.18692 • Published 5 days ago • 42

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 8 days ago • 166

upvoted 2 papers 3 days ago

Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale

Paper • 2512.10398 • Published Dec 11, 2025 • 13

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Paper • 2601.18137 • Published 5 days ago • 23

upvoted 3 papers 4 days ago

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published 9 days ago • 20

Endless Terminals: Scaling RL Environments for Terminal Agents

Paper • 2601.16443 • Published 8 days ago • 16

SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents

Paper • 2601.16746 • Published 8 days ago • 86

upvoted 3 papers 5 days ago

Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model

Paper • 2601.15892 • Published 9 days ago • 53

Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics

Paper • 2601.14027 • Published 11 days ago • 12

Learning to Discover at Test Time

Paper • 2601.16175 • Published 9 days ago • 41

upvoted 3 papers 6 days ago

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

Paper • 2601.14750 • Published 10 days ago • 17

Rethinking Video Generation Model for the Embodied World

Paper • 2601.15282 • Published 10 days ago • 42

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published 9 days ago • 82

upvoted 4 papers 7 days ago

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Paper • 2601.08808 • Published 18 days ago • 38

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published 13 days ago • 186

SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations

Paper • 2512.14080 • Published Dec 16, 2025 • 8

An Empirical Study of Testing Practices in Open Source AI Agent Frameworks and Agentic Applications

Paper • 2509.19185 • Published Sep 23, 2025 • 4