3 1 93

Casimiro Ferreira

Jarbas

https://tigregotico.pt

AI & ML interests

None yet

Recent Activity

liked a model about 16 hours ago

fdemelo/ovos-hierarchical-knn-granite-97m-multilingual-r2

liked a model 6 days ago

yuriyvnv/WAVe-1B-Multimodal-NL

reacted to yuriyvnv's post with 🔥 6 days ago

📄 The WAVe paper is officially out in the Information Sciences Journal. You saw the PT and NL model releases earlier this year. This is the peer-reviewed paper behind them, with the full method, ablations, and downstream ASR evaluation. Quick recap: WAVe is a 1B multimodal embedding model that filters synthetic speech at the word level, not the sentence level. On Portuguese ASR it cuts training steps by 34%, improves cross-domain generalization by 50%, and matches WER with 30% less synthetic data. 📦 Resources - Paper: https://www.sciencedirect.com/science/article/pii/S0020025526005220 - PT model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-PT - NL model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-NL - Collection: https://huggingface.co/collections/yuriyvnv/multi-modal-embeddings-for-synthetic-transcript-filtering - Code: https://github.com/yuriyvnv/WAVe If you train ASR on synthetic or back-translated data, would like to see WAVe benchmarked on other languages. @reach-vb @ylacombe @hf-audio @BramVanroy #speech #asr #multimodal #syntheticdata #lowresource

View all activity

Organizations

liked a model about 16 hours ago

fdemelo/ovos-hierarchical-knn-granite-97m-multilingual-r2

Updated about 12 hours ago • 1

liked a model 6 days ago

yuriyvnv/WAVe-1B-Multimodal-NL

Audio Classification • 0.9B • Updated Feb 14 • 14 • 2

liked a dataset 6 days ago

apptek-com/apptek_callcenter_dialogues

Viewer • Updated 3 days ago • 1.75k • 3.27k • 27

liked 2 datasets 8 days ago

OwnedByDanes/Usenet-Corpus-1980-2013

Viewer • Updated 7 days ago • 65k • 376 • 14

BSC-LT/distilled-catalan-youtube-speech

Updated 8 days ago • 70 • 1

liked a dataset 21 days ago

proxectonos/wikipedia_multiple_choice_qa

Viewer • Updated 21 days ago • 2.03k • 102 • 1

liked a model 23 days ago

yuriyvnv/Qwen3-ASR-1.7B-PT

Automatic Speech Recognition • 2B • Updated 21 days ago • 168 • 1

liked a model about 1 month ago

Sourajit123/SouraTTS

Text-to-Speech • Updated Mar 16 • 3

liked 2 datasets about 1 month ago

allenai/ai2_arc

Viewer • Updated Dec 21, 2023 • 7.79k • 432k • 337

malaysia-ai/Multilingual-TTS

Viewer • Updated Mar 9 • 62.7M • 4.3k • 19

liked 3 datasets about 2 months ago

liked a model 2 months ago

Tabahi/CUPE-2i

Audio Classification • Updated Jan 12 • 7

liked 4 models 3 months ago

BSC-LT/mRoBERTa

Fill-Mask • 0.3B • Updated Aug 7, 2025 • 270 • 7

BSC-LT/MrBERT

Fill-Mask • Updated Mar 26 • 635 • 8

BSC-LT/MrBERT-es

Fill-Mask • 0.2B • Updated Apr 9 • 6.25k • • 7

BSC-LT/MrBERT-ca

Fill-Mask • Updated 21 days ago • 62 • 2

liked a dataset 3 months ago

BSC-LT/Catalan-Aranese_Parallel_Corpus

Viewer • Updated Feb 6 • 539k • 26 • 1

liked a model 3 months ago

marksverdhei/Qwen3-Voice-Embedding-12Hz-0.6B-onnx

Feature Extraction • Updated Feb 23 • 21

Casimiro Ferreira

AI & ML interests

Recent Activity

Organizations

Jarbas's activity