Edit Models filters
Apps
Inference Providers
Active filters: audio-visual
Memories-ai/UGC-VideoCaptioner
Video-Text-to-Text • 6B • Updated • 27 • 2
bpiyush/sound-of-water-models
Audio Classification • Updated • 3
bolinlai/CSTS
Updated • 5
openinterx/UGC-VideoCaptioner
Video-Text-to-Text • 6B • Updated • 11 • 4
JusperLee/Dolphin
Audio-to-Audio • 7.04M • Updated • 11.5k • 13
matbee/sam-audio-small-onnx
Updated • 9
matbee/sam-audio-large-onnx
Updated • 8
square-zero-labs/sam-audio-small-onnx
Updated
lopho/ltx2-artist-loras
Updated • 3
dnamodel/tsam-viewer-emotions
Video Classification • Updated • 2
oonepieceeyewear/UGC-VideoCaptioner
Video-Text-to-Text • 6B • Updated • 2
ckoutlis/auvire-lavdf
Updated • 5
ckoutlis/auvire-avdeepfake1m
Updated • 3
Vegetabot/AVSQwen-Omni-7B
Image-Text-to-Text • 11B • Updated • 5
Vegetabot/AVSQwen-Omni-3B
Image-Text-to-Text • 6B • Updated • 4
vsro200/models-vsro200
Video-Text-to-Text • Updated
mhussainahmad/averformer-ravdess
Updated • 24
mhussainahmad/averformer-cremad-v4
Updated • 41
mhussainahmad/averformer-ravdess-v4
Updated • 81
mhussainahmad/averformer-meld-v4
Updated • 53