nvidia/parakeet-tdt-0.6b-v3 Automatic Speech Recognition • 0.6B • Updated about 16 hours ago • 6.62k • 856
Running on Zero Agents Featured 1.04k Joy Caption Beta One 🖼 1.04k Generate detailed captions for any image
fancyfeast/llama-joycaption-beta-one-hf-llava Image-Text-to-Text • 8B • Updated May 16, 2025 • 122k • 349
Running on Zero Agents 800 IndexTTS 2 Demo 🏢 800 Generate expressive speech from text and voice prompts
Runtime error Agents 216 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System 🎙 216 Generate speech from text using a reference audio