ZhangXiaoyun
DadaCloud01
AI & ML interests
None yet
Recent Activity
authored a paper 3 days ago
Rediscovering Entropy Regularization: Adaptive Coefficient Unlocks Its
Potential for LLM Reinforcement Learning authored a paper 3 days ago
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters