Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
57.9
TFLOPS
3
1
93
Casimiro Ferreira
Jarbas
Follow
fdemelo's profile picture
Lmagoncalo's profile picture
Spvkezant78's profile picture
11 followers
·
48 following
https://tigregotico.pt
JarbasAl
casimiro-ferreira-953783151
AI & ML interests
None yet
Recent Activity
liked
a model
about 16 hours ago
fdemelo/ovos-hierarchical-knn-granite-97m-multilingual-r2
liked
a model
6 days ago
yuriyvnv/WAVe-1B-Multimodal-NL
reacted
to
yuriyvnv
's
post
with 🔥
6 days ago
📄 The WAVe paper is officially out in the Information Sciences Journal. You saw the PT and NL model releases earlier this year. This is the peer-reviewed paper behind them, with the full method, ablations, and downstream ASR evaluation. Quick recap: WAVe is a 1B multimodal embedding model that filters synthetic speech at the word level, not the sentence level. On Portuguese ASR it cuts training steps by 34%, improves cross-domain generalization by 50%, and matches WER with 30% less synthetic data. 📦 Resources - Paper: https://www.sciencedirect.com/science/article/pii/S0020025526005220 - PT model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-PT - NL model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-NL - Collection: https://huggingface.co/collections/yuriyvnv/multi-modal-embeddings-for-synthetic-transcript-filtering - Code: https://github.com/yuriyvnv/WAVe If you train ASR on synthetic or back-translated data, would like to see WAVe benchmarked on other languages. @reach-vb @ylacombe @hf-audio @BramVanroy #speech #asr #multimodal #syntheticdata #lowresource
View all activity
Organizations
Jarbas
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
about 16 hours ago
fdemelo/ovos-hierarchical-knn-granite-97m-multilingual-r2
Updated
about 12 hours ago
•
1
liked
a model
6 days ago
yuriyvnv/WAVe-1B-Multimodal-NL
Audio Classification
•
0.9B
•
Updated
Feb 14
•
14
•
2
liked
a dataset
6 days ago
apptek-com/apptek_callcenter_dialogues
Viewer
•
Updated
3 days ago
•
1.75k
•
3.27k
•
27
liked
2 datasets
8 days ago
OwnedByDanes/Usenet-Corpus-1980-2013
Viewer
•
Updated
7 days ago
•
65k
•
376
•
14
BSC-LT/distilled-catalan-youtube-speech
Updated
8 days ago
•
70
•
1
liked
a dataset
21 days ago
proxectonos/wikipedia_multiple_choice_qa
Viewer
•
Updated
21 days ago
•
2.03k
•
102
•
1
liked
a model
23 days ago
yuriyvnv/Qwen3-ASR-1.7B-PT
Automatic Speech Recognition
•
2B
•
Updated
21 days ago
•
168
•
1
liked
a model
about 1 month ago
Sourajit123/SouraTTS
Text-to-Speech
•
Updated
Mar 16
•
3
liked
2 datasets
about 1 month ago
allenai/ai2_arc
Viewer
•
Updated
Dec 21, 2023
•
7.79k
•
432k
•
337
malaysia-ai/Multilingual-TTS
Viewer
•
Updated
Mar 9
•
62.7M
•
4.3k
•
19
liked
3 datasets
about 2 months ago
acon96/Home-Assistant-Requests-V2
Viewer
•
Updated
Dec 22, 2025
•
240k
•
386
•
9
proxectonos/aya_nos
Updated
Apr 5
•
10
•
1
KickItLikeShika/NileTTS-dataset
Updated
Mar 23
•
437
•
1
liked
a model
2 months ago
Tabahi/CUPE-2i
Audio Classification
•
Updated
Jan 12
•
7
liked
4 models
3 months ago
BSC-LT/mRoBERTa
Fill-Mask
•
0.3B
•
Updated
Aug 7, 2025
•
270
•
7
BSC-LT/MrBERT
Fill-Mask
•
Updated
Mar 26
•
635
•
8
BSC-LT/MrBERT-es
Fill-Mask
•
0.2B
•
Updated
Apr 9
•
6.25k
•
•
7
BSC-LT/MrBERT-ca
Fill-Mask
•
Updated
21 days ago
•
62
•
2
liked
a dataset
3 months ago
BSC-LT/Catalan-Aranese_Parallel_Corpus
Viewer
•
Updated
Feb 6
•
539k
•
26
•
1
liked
a model
3 months ago
marksverdhei/Qwen3-Voice-Embedding-12Hz-0.6B-onnx
Feature Extraction
•
Updated
Feb 23
•
21
Load more