cloud19
/

NSFW_MMaudio

Not-For-All-Audiences

Model card Files Files and versions

would it be easy to train v2 instead?

#3

by phazei - opened Oct 11, 2025

phazei

Oct 11, 2025

you trained mmaudio v1. unfortunately that one only outputs 5s fixed. v2 is able to create dynamic lengths. v1 has issues the authors address which is why they came out with v2 right away.

cloud19

Owner Oct 14, 2025

Yeah, at the time I trained it, only v1 had the scripts for retraining available. The author had mentioned that the tools for v2 weren't in the repo yet. I'll check and see if that's changed now.

baba123

Oct 30, 2025

any update on this? thanks for your hard work

Nov 8, 2025

This comment has been hidden

Nov 8, 2025

A v2 version would be amazing!

Nov 8, 2025

I am also waiting for version 2, as well as information about trigger words! ❤️

cloud19

Owner Nov 11, 2025

I see the demand, and I will continue my work soon

cloud19

Owner Nov 16, 2025

The developer of MMaudio explicitly states that training for v2 is not supported https://github.com/hkchengrex/MMAudio/blob/main/docs/TRAINING.md
This was the case previously, and nothing has changed. I will look for other solutions

cloud19

Owner Nov 16, 2025

I plan to work with ThinkSound

cloud19

Owner Nov 16, 2025

I am compiling a dataset for ~40 hours. With a wide variety. It will definitely be ThinkSound

6 days ago

Could you share your dataset?

6 days ago

Did you get anywhere with ThinkSound ?

baba123

6 days ago

Also curious if ThinkSound worked out 👍

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment