arxiv:2603.16448
ZhangXiaoyun
DadaCloud01
AI & ML interests
None yet
Recent Activity
authored a paper 2 days ago
Rediscovering Entropy Regularization: Adaptive Coefficient Unlocks Its
Potential for LLM Reinforcement Learning authored a paper 2 days ago
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters