Improve model card: Add pipeline tag, library name, and update CE-GPPO paper/code links

by nielsr HF Staff - opened Sep 27, 2025

←

This PR enhances the model card for Kwai-Klear/Klear-Reasoner-8B, which is based on the CE-GPPO method.

Key improvements include:

Adding pipeline_tag: text-generation to improve discoverability on the Hugging Face Hub.
Specifying library_name: transformers to enable the automated "how to use" widget, as the model is compatible with the transformers library.
Updating the primary paper links in the model card to point to the CE-GPPO paper (CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning).
Updating the GitHub link to the official CE-GPPO repository (https://github.com/Kwai-Klear/CE-GPPO).
Updating the "Training" section to reflect the instructions from the CE-GPPO GitHub repository for better consistency and accuracy.
Updating the citation section to reference the correct CE-GPPO paper.

Please review and merge this PR if everything looks good.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment