voc2vec is a foundation model specifically designed for non-verbal human data.
Alkis Koudounas
alkiskoudounas
AI & ML interests
Speech and Audio Processing | Multimodal Understanding | Trustworthy and Responsible AI
Organizations
models 56
alkiskoudounas/mt5-xl-mugec-lora-10best
Updated • 4
alkiskoudounas/mt5-xl-mugec-lora-3best
Updated
alkiskoudounas/xls-r-128-italic-noisy
Audio Classification • Updated • 8
alkiskoudounas/xls-r-128-italic-massive
Audio Classification • Updated • 7
alkiskoudounas/xls-r-128-italic-speaker
Audio Classification • 0.3B • Updated • 11
alkiskoudounas/xls-r-53-it-italic-speaker
Audio Classification • Updated • 7
alkiskoudounas/xls-r-53-fr-speechmassive-fr-FR-gold
Audio Classification • 0.3B • Updated • 6
alkiskoudounas/xls-r-128-speechmassive-fr-FR-gold
Audio Classification • 0.3B • Updated • 5
alkiskoudounas/xls-r-53-de-speechmassive-de-DE-gold
Audio Classification • 0.3B • Updated • 7
alkiskoudounas/xls-r-128-speechmassive-de-DE-gold
Audio Classification • 0.3B • Updated • 12
datasets 0
None public yet