Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Sneha7
/
phi2-helpfulness-grpo-demo
like
1
Runtime error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
phi2-helpfulness-grpo-demo
10.2 kB
1 contributor
History:
46 commits
Sneha7
Update grpo_train.py
c8e38f6
verified
5 days ago
.gitattributes
1.52 kB
initial commit
9 days ago
README.md
304 Bytes
Update README.md
9 days ago
app.py
1.59 kB
Update app.py
5 days ago
grpo_train.py
4.26 kB
Update grpo_train.py
5 days ago
policy.py
1.48 kB
Update policy.py
5 days ago
requirements.txt
72 Bytes
Update requirements.txt
8 days ago
reward_fn.py
937 Bytes
Create reward_fn.py
9 days ago