CHARM: Calibrating Reward Models With Chatbot Arena Scores Paper • 2504.10045 • Published Apr 14, 2025
XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation Paper • 2503.22973 • Published Mar 29, 2025
DocHPLT: A Massively Multilingual Document-Level Translation Dataset Paper • 2508.13079 • Published Aug 18, 2025 • 1
MatheMagic: Generating Dynamic Mathematics Benchmarks Robust to Memorization Paper • 2510.05962 • Published Oct 7, 2025
HPLT 3.0: Very Large-Scale Multilingual Resources for LLM and MT. Mono- and Bi-lingual Data, Multilingual Evaluation, and Pre-Trained Models Paper • 2511.01066 • Published Nov 2, 2025 • 3
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation Paper • 2606.06428 • Published Jun 4 • 25
Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters Paper • 2507.13618 • Published Jul 18, 2025 • 17
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper • 2508.14460 • Published Aug 20, 2025 • 86
A Controllable Examination for Long-Context Language Models Paper • 2506.02921 • Published Jun 3, 2025 • 34
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13, 2025 • 2
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13, 2025 • 2
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13, 2025 • 2
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13, 2025 • 2
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26, 2024 • 2
The Highs and Lows of Simple Lexical Domain Adaptation Approaches for Neural Machine Translation Paper • 2101.00421 • Published Jan 2, 2021
To Adapt or to Fine-tune: A Case Study on Abstractive Summarization Paper • 2208.14559 • Published Aug 30, 2022
A Unified Model for Reverse Dictionary and Definition Modelling Paper • 2205.04602 • Published May 9, 2022
Approaching Neural Chinese Word Segmentation as a Low-Resource Machine Translation Task Paper • 2008.05348 • Published Aug 12, 2020
The University of Edinburgh's Submission to the WMT22 Code-Mixing Shared Task (MixMT) Paper • 2210.11309 • Published Oct 20, 2022