HWERI
/

llama2-exams-orca-sharegpt

Text Generation

text-generation-inference

Model card Files Files and versions

CaterinaLac commited on Oct 23, 2023

Commit

0d6c33d

·

1 Parent(s): fee6c22

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -12,4 +12,6 @@ language:
 - fr
 ---
-This model is a Llama2-7B model finetuned on the union of ShareGPT, the exams dataset and Orca.

 - fr
 ---
+This model is a Llama2-7B model finetuned on the union of ShareGPT, the exams dataset and a subset of the Orca dataset.
+The finetuning was performed with [DeepSpeed Chat](https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat) toolkit (step 1, sft).
+The model run for three epochs before reaching a plateau on the validation dataset. We used a cosine scheduler, with an initial LR of 2e-5.