Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners
xuxin
xx18
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
updated
a model
about 1 month ago
xx18/TFPI-Qwen3-4B-Thinking-2507-Stage3