Models

21

Full-text search

Active filters: audio-visual

elix3r/LTX-2.3-22b-AV-LoRA-talking-head

Image-to-Video • Updated Mar 24 • 8.56k • 54

Memories-ai/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Oct 5, 2025 • 27 • 2

bpiyush/sound-of-water-models

Audio Classification • Updated Jan 13, 2025 • 3

bolinlai/CSTS

Updated Mar 18, 2025 • 5

openinterx/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Jul 19, 2025 • 11 • 4

JusperLee/Dolphin

Audio-to-Audio • 7.04M • Updated Apr 13 • 11.5k • 13

matbee/sam-audio-small-onnx

Updated Dec 24, 2025 • 9

matbee/sam-audio-large-onnx

Updated Dec 23, 2025 • 8

square-zero-labs/sam-audio-small-onnx

lopho/ltx2-artist-loras

Updated Apr 2 • 3

dnamodel/tsam-viewer-emotions

Video Classification • Updated Mar 27 • 2

oonepieceeyewear/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Apr 15 • 2

ckoutlis/auvire-lavdf

Updated Apr 21 • 5

ckoutlis/auvire-avdeepfake1m

Updated Apr 21 • 3

Vegetabot/AVSQwen-Omni-7B

Image-Text-to-Text • 11B • Updated Apr 24 • 5

Vegetabot/AVSQwen-Omni-3B

Image-Text-to-Text • 6B • Updated Apr 24 • 4

vsro200/models-vsro200

Video-Text-to-Text • Updated 21 days ago

mhussainahmad/averformer-ravdess

Updated 14 days ago • 24

mhussainahmad/averformer-cremad-v4

Updated 6 days ago • 41

mhussainahmad/averformer-ravdess-v4

Updated 6 days ago • 81

mhussainahmad/averformer-meld-v4

Updated 6 days ago • 53