Related paper: "Should We Still Pretrain Encoders with Masked Language Modeling?"
AI & ML interests
NLP, Information Retrieval, Computer Vision, Uncertainty Estimation, Trustworthy AI, Bias Estimation, Unbalanced ML, Choice Modeling, Time Series
Recent Activity
View all activity
Suite of Encoder models EuroBERT
-
EuroBERT/EuroBERT-210m
Fill-Mask • 0.3B • Updated • 9.1k • 78 -
EuroBERT/EuroBERT-610m
Fill-Mask • 0.8B • Updated • 2.33k • 32 -
EuroBERT/EuroBERT-2.1B
Fill-Mask • 2B • Updated • 258 • 63 -
EuroBERT: Scaling Multilingual Encoders for European Languages
Paper • 2503.05500 • Published • 80
Related paper: "Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis" (accepted at WMT 2024)
-
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis
Paper • 2409.20059 • Published • 16 -
hgissbkh/ALMA-13B-LoRA
Text Generation • 13B • Updated • 18 -
hgissbkh/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi
Text Generation • 13B • Updated • 12 -
hgissbkh/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi-No-Base
Text Generation • 13B • Updated • 10
Related paper: "Should We Still Pretrain Encoders with Masked Language Modeling?"
Suite of Encoder models EuroBERT
-
EuroBERT/EuroBERT-210m
Fill-Mask • 0.3B • Updated • 9.1k • 78 -
EuroBERT/EuroBERT-610m
Fill-Mask • 0.8B • Updated • 2.33k • 32 -
EuroBERT/EuroBERT-2.1B
Fill-Mask • 2B • Updated • 258 • 63 -
EuroBERT: Scaling Multilingual Encoders for European Languages
Paper • 2503.05500 • Published • 80
Related paper: "Towards Trustworthy Reranking: A Simple yet Effective Abstention Mechanism" (accepted at TMLR 2024)
Related paper: "Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis" (accepted at WMT 2024)
-
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis
Paper • 2409.20059 • Published • 16 -
hgissbkh/ALMA-13B-LoRA
Text Generation • 13B • Updated • 18 -
hgissbkh/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi
Text Generation • 13B • Updated • 12 -
hgissbkh/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi-No-Base
Text Generation • 13B • Updated • 10