fblgit
/

juanako-7b-v1

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions

fblgit commited on Nov 28, 2023

Commit

3f46fb4

·

1 Parent(s): 8754056

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -14,6 +14,9 @@ license: artistic-2.0
 # juanako-7b-v1 (UNA: Uniform Neural Alignment)
 This model uses uniform neural alignment (UNA) for the DPO training phases and is a fine-tuned version of [fblgit/zephyr-lora-dpo-b1](https://huggingface.co/fblgit/zephyr-lora-dpo-b1) on the HuggingFaceH4/ultrafeedback_binarized dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.4594
 - Rewards/chosen: -1.1095
@@ -27,7 +30,7 @@ It achieves the following results on the evaluation set:
 Followed [alignment-handbook](https://github.com/huggingface/alignment-handbook) to perform DPO (Phase 2) over Zephyr-SFT model.
-**Please feel free to run more tests and commit the results. Also if you are interested to participate in [UNA's paper research or GPU sponsorship](mailto:info@fblnet.net) to support UNA research, feel free to contact.**
 Special thanks to [TheBloke](https://huggingface.co/TheBloke) for converting the model into multiple formats and overall his enormous contribution to the community.
 Here are the models:

 # juanako-7b-v1 (UNA: Uniform Neural Alignment)
 This model uses uniform neural alignment (UNA) for the DPO training phases and is a fine-tuned version of [fblgit/zephyr-lora-dpo-b1](https://huggingface.co/fblgit/zephyr-lora-dpo-b1) on the HuggingFaceH4/ultrafeedback_binarized dataset.
+**It is recommended to use the latest [Juanako Version](https://huggingface.co/fblgit/juanako-7b-UNA) which highly outperforms the v1**
 It achieves the following results on the evaluation set:
 - Loss: 0.4594
 - Rewards/chosen: -1.1095
 Followed [alignment-handbook](https://github.com/huggingface/alignment-handbook) to perform DPO (Phase 2) over Zephyr-SFT model.
+**Please feel free to run more tests and commit the results. Also if you are interested to participate in [UNA's paper research or GPU sponsorship](mailto:xavi@juanako.ai) to support UNA research, feel free to contact.**
 Special thanks to [TheBloke](https://huggingface.co/TheBloke) for converting the model into multiple formats and overall his enormous contribution to the community.
 Here are the models: