-
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Paper • 2503.22675 • Published • 36 -
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
Paper • 2503.22230 • Published • 45 -
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
Paper • 2509.13313 • Published • 80 -
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Paper • 2509.13309 • Published • 67
bypan
bypan123
·
AI & ML interests
None yet
Recent Activity
updated
a model
12 days ago
UBTECH-Robotics/Thinker-4B
published
a model
12 days ago
UBTECH-Robotics/Thinker-4B
upvoted
a
paper
19 days ago
Robix: A Unified Model for Robot Interaction, Reasoning and Planning