KlearReasoner - a Kwai-Klear Collection

Kwai-Klear 's Collections

mini-swe-agent-plus

Klear-AgentForge

KlearReasoner

updated Dec 9, 2025

KlearReasoner

Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning

Paper • 2512.05591 • Published Dec 5, 2025 • 17
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

Paper • 2509.20712 • Published Sep 25, 2025 • 20
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Paper • 2508.07629 • Published Aug 11, 2025 • 43
Kwai-Klear/Klear-Reasoner-8B

8B • Updated Sep 27, 2025 • 96 • 19
Kwai-Klear/KlearReasoner-MathSub-30K

Viewer • Updated Jan 6 • 30k • 112 • 3
Kwai-Klear/KlearReasoner-CodeSub-15K

Viewer • Updated Sep 27, 2025 • 15k • 36 • 7
Kwai-Klear/Klear-Reasoner-8B-SFT

8B • Updated Sep 27, 2025 • 29 • 2