hazyresearch
/

Weaver_Distilled_All_Datasets_gte-Qwen2-1.5B-instruct

Text Classification

Model card Files Files and versions

jonsaadfalcon commited on Jun 12

Commit

ba2e06b

·

verified ·

1 Parent(s): 54b0d94

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,6 @@
 # Weaver Distilled - All Datasets (gte-Qwen2-1.5B-instruct)
 This is a distilled cross-encoder model based on [Alibaba-NLP/gte-Qwen2-1.5B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct), trained to predict the correctness of answers across multiple domains: [MATH500](https://huggingface.co/datasets/HuggingFaceH4/MATH-500), [GPQA](https://huggingface.co/datasets/Idavidrein/gpqa), and [MMLU Pro](https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro). This general-purpose verifier was trained on Weaver scores aggregated over 35 different verifiers and reward models.

+---
+license: mit
+---
 # Weaver Distilled - All Datasets (gte-Qwen2-1.5B-instruct)
 This is a distilled cross-encoder model based on [Alibaba-NLP/gte-Qwen2-1.5B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct), trained to predict the correctness of answers across multiple domains: [MATH500](https://huggingface.co/datasets/HuggingFaceH4/MATH-500), [GPQA](https://huggingface.co/datasets/Idavidrein/gpqa), and [MMLU Pro](https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro). This general-purpose verifier was trained on Weaver scores aggregated over 35 different verifiers and reward models.