Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
TAUR-dev
/
M-1110_star__oursfixed_alltask-rl
like
0
Follow
Text Analysis, Understanding, and Reasoning Development
21
Safetensors
English
qwen2
License:
mit
Model card
Files
Files and versions
xet
Community
main
M-1110_star__oursfixed_alltask-rl
3.57 GB
1 contributor
History:
2 commits
Jacklu0831
Upload rl RL model from experiment 1110_star__oursfixed_alltask
87bf87c
verified
about 2 months ago
.gitattributes
1.57 kB
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
README.md
722 Bytes
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
added_tokens.json
605 Bytes
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
chat_template.jinja
2.51 kB
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
config.json
684 Bytes
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
generation_config.json
242 Bytes
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
merges.txt
1.67 MB
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
model.safetensors
3.55 GB
xet
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
special_tokens_map.json
613 Bytes
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
tokenizer.json
11.4 MB
xet
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
tokenizer_config.json
4.71 kB
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago
vocab.json
2.78 MB
Upload rl RL model from experiment 1110_star__oursfixed_alltask
about 2 months ago