-
-
-
-
-
-
Inference Providers
Active filters:
orpo
cookinai/NeuralLlama-3-ORPO
Text Generation
•
Updated
•
8
MuntasirHossain/Meta-Llama-3-8B-OpenOrca
Text Generation
•
8B
•
Updated
•
14
after-exams/deepseek-math-7b-rl-orpo-merged
Text Generation
•
7B
•
Updated
•
8
FINGU-AI/Qwen-Orpo-v1
Text Generation
•
0.5B
•
Updated
•
11
euiyulsong/Mistral-7B-PC
Text Generation
•
7B
•
Updated
•
6
euiyulsong/Mistral-7B-PC_Minus
Text Generation
•
7B
•
Updated
•
7
euiyulsong/Mistral-7B-PC_Brier
Text Generation
•
7B
•
Updated
•
7
euiyulsong/Mistral-7B-ORPO-6kpre
Text Generation
•
7B
•
Updated
•
7
euiyulsong/Mistral-7B-ORPO-ARC-All
Text Generation
•
7B
•
Updated
•
8
euiyulsong/Mistral-7B-ORPO-Sync-1k
Text Generation
•
7B
•
Updated
•
5
euiyulsong/Mistral-7B-ORPO-Semi-Sync-1k
Text Generation
•
7B
•
Updated
•
7
euiyulsong/Mistral-7B-ORPO-Semi-task_domain_20k
Text Generation
•
7B
•
Updated
•
9
euiyulsong/Mistral-7B-ORPO-sft-sync-task_domain_20k
Text Generation
•
7B
•
Updated
•
7
euiyulsong/Mistral-7B-ORPO-sft-synth-500
Text Generation
•
7B
•
Updated
•
6
euiyulsong/Mistral-7B-SFT-synth1k-taskdomain
Text Generation
•
7B
•
Updated
•
8
euiyulsong/ORPO-synth1k-20kdomaintask-semi
Text Generation
•
7B
•
Updated
•
4
euiyulsong/ORPO-synth3k-semi
Text Generation
•
7B
•
Updated
•
8
euiyulsong/ORPO-task-domain-20k-synth3k-semi
Text Generation
•
7B
•
Updated
•
7
statking/zephyr-7b-sft-full-orpo
Text Generation
•
7B
•
Updated
•
17
baconnier/Gaston_Yi-1.5-9B-Chat
Text Generation
•
9B
•
Updated
•
10
baconnier/Gaston_dolphin-2.9.1-yi-1.5-9b
9B
•
Updated
•
2
baconnier/Notaires_dolphin-2.9.1-yi-1.5-9b
Text Generation
•
9B
•
Updated
•
8
mradermacher/Gaston_dolphin-2.9.1-yi-1.5-9b-GGUF
9B
•
Updated
•
219
mradermacher/Notaires_Yi-1.5-9B-Chat-GGUF
9B
•
Updated
•
174
mradermacher/Notaires_dolphin-2.9.1-yi-1.5-9b-GGUF
9B
•
Updated
•
198
mradermacher/Gaston_Yi-1.5-9B-Chat-GGUF
9B
•
Updated
•
221
Magneto/lora_16bit_orpo
Text Generation
•
Updated
•
5
apps90/OrpoGPT2
Text Generation
•
0.1B
•
Updated
•
6
statking/Meta-Llama-3-70B-Instruct
Updated
•
11
statking/Meta-Llama-3-8B-Instruct-ORPO-QLoRA
Updated
•
15