Whisper Models Dutch Language Collection This repo contains Dutch Whisper models finetuned on CV and other synthetic data, with different filtering options • 11 items • Updated Sep 16, 2025 • 2
Whisper Models Portuguese Language Collection This Repo contains Whisper models trained on subsets of data like Common Voice 17(CV_17), Synthetic(Generated by OpenAI) + CV17 and Synthetic Only. • 13 items • Updated 2 days ago • 2
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Dec 10, 2025 • 21
Seamless: Multilingual Expressive and Streaming Speech Translation Paper • 2312.05187 • Published Dec 8, 2023 • 14