Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
princeton-nlp
's Collections
RLMT Experiments
SimPO
SWE-bench
ProLong
Sheared Llama
SimCSE
SimPO
updated
Mar 16, 2025
This collections contains a list of SimPO and baseline models.
Upvote
24
+14
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation
•
9B
•
Updated
Aug 2, 2024
•
634
•
•
172
princeton-nlp/gemma-2-9b-it-DPO
Text Generation
•
9B
•
Updated
Jul 18, 2024
•
25
•
•
9
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
61
•
•
1
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
634
•
princeton-nlp/Llama-3-Base-8B-SFT-KTO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
50
•
princeton-nlp/Llama-3-Base-8B-SFT-ORPO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
46
•
princeton-nlp/Llama-3-Base-8B-SFT-RDPO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
56
•
princeton-nlp/Llama-3-Base-8B-SFT-SimPO
Text Generation
•
8B
•
Updated
May 24, 2024
•
185
•
•
1
princeton-nlp/Llama-3-Base-8B-SFT
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
1.44k
•
•
4
princeton-nlp/Llama-3-Instruct-8B-SimPO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
78
•
•
60
princeton-nlp/Llama-3-Instruct-8B-IPO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
21
•
princeton-nlp/Llama-3-Instruct-8B-KTO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
20
•
princeton-nlp/Llama-3-Instruct-8B-ORPO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
23
•
princeton-nlp/Llama-3-Instruct-8B-RDPO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
25
•
princeton-nlp/Llama-3-Instruct-8B-DPO
Text Generation
•
8B
•
Updated
Jun 17, 2024
•
90
•
princeton-nlp/Mistral-7B-Instruct-RDPO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
28
princeton-nlp/Mistral-7B-Instruct-DPO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
28
princeton-nlp/Mistral-7B-Instruct-IPO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
25
princeton-nlp/Mistral-7B-Instruct-KTO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
26
princeton-nlp/Mistral-7B-Instruct-SimPO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
27
•
2
princeton-nlp/Mistral-7B-Instruct-ORPO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
19
princeton-nlp/Mistral-7B-Base-SFT-IPO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
20
princeton-nlp/Mistral-7B-Base-SFT-KTO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
21
princeton-nlp/Mistral-7B-Base-SFT-DPO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
19
princeton-nlp/Mistral-7B-Base-SFT-RDPO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
27
princeton-nlp/Mistral-7B-Base-SFT-SimPO
Text Generation
•
7B
•
Updated
Jun 17, 2024
•
166
princeton-nlp/llama3-ultrafeedback
Viewer
•
Updated
Jul 18, 2024
•
61.8k
•
775
•
18
princeton-nlp/Mistral-7B-Base-SFT-CPO
Text Generation
•
7B
•
Updated
Sep 30, 2024
•
18
•
1
princeton-nlp/Mistral-7B-Base-SFT-RRHF
Text Generation
•
7B
•
Updated
Sep 30, 2024
•
23
princeton-nlp/Mistral-7B-Base-SFT-SLiC-HF
Text Generation
•
7B
•
Updated
Jul 7, 2024
•
25
princeton-nlp/Mistral-7B-Instruct-CPO
Text Generation
•
7B
•
Updated
Jul 7, 2024
•
21
princeton-nlp/Mistral-7B-Instruct-RRHF
Text Generation
•
7B
•
Updated
Jul 7, 2024
•
23
princeton-nlp/Mistral-7B-Instruct-SLiC-HF
Text Generation
•
7B
•
Updated
Jul 7, 2024
•
30
princeton-nlp/Llama-3-Base-8B-SFT-CPO
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
43
•
princeton-nlp/Llama-3-Base-8B-SFT-RRHF
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
39
•
princeton-nlp/Llama-3-Base-8B-SFT-SLiC-HF
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
45
•
princeton-nlp/Llama-3-Instruct-8B-CPO
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
22
•
princeton-nlp/Llama-3-Instruct-8B-RRHF
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
21
•
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
21
•
princeton-nlp/Llama-3-Instruct-8B-RRHF-v0.2
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
24
•
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF-v0.2
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
24
•
princeton-nlp/Llama-3-Instruct-8B-DPO-v0.2
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
24
•
princeton-nlp/Llama-3-Instruct-8B-IPO-v0.2
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
24
•
princeton-nlp/Llama-3-Instruct-8B-CPO-v0.2
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
22
•
princeton-nlp/Llama-3-Instruct-8B-KTO-v0.2
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
21
•
princeton-nlp/Llama-3-Instruct-8B-ORPO-v0.2
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
27
•
princeton-nlp/Llama-3-Instruct-8B-RDPO-v0.2
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
20
•
princeton-nlp/Llama-3-Instruct-8B-SimPO-v0.2
Text Generation
•
8B
•
Updated
Jul 7, 2024
•
283
•
•
8
princeton-nlp/llama3-ultrafeedback-armorm
Viewer
•
Updated
Jul 18, 2024
•
61.8k
•
814
•
20
Upvote
24
+20
Share collection
View history
Collection guide
Browse collections