FunAudioLLM

company

FunAudioLLM

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

xianbao authored a paper about 1 month ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

aluminumbox updated a model 4 months ago

FunAudioLLM/CosyVoice-300M-Instruct

aluminumbox published a model 4 months ago

FunAudioLLM/CosyVoice-300M-Instruct

View all activity

xianbao

authored a paper about 1 month ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

Paper • 2510.17950 • Published Oct 20 • 7

aluminumbox

updated a model 4 months ago

FunAudioLLM/CosyVoice-300M-Instruct

Updated Aug 11 • 1

aluminumbox

published a model 4 months ago

FunAudioLLM/CosyVoice-300M-Instruct

Updated Aug 11 • 1

aluminumbox

updated a model 4 months ago

FunAudioLLM/CosyVoice-300M-SFT

Updated Aug 1

aluminumbox

published a model 4 months ago

FunAudioLLM/CosyVoice-300M-SFT

Updated Aug 1

aluminumbox

updated 3 models 4 months ago

aluminumbox

published 3 models 4 months ago

FunAudioLLM/CosyVoice-ttsfrd

Updated Jul 29 • 1

FunAudioLLM/CosyVoice-300M

Updated Jul 31 • 1

FunAudioLLM/CosyVoice2-0.5B

Updated Jul 31 • 30

liuhuadai

updated a Space 4 months ago

ThinkSound

🔊

306

Generate audio for a video using captions and descriptions

liuhuadai

in FunAudioLLM/ThinkSound 5 months ago

feat: Enable MCP

#4 opened 5 months ago by

multimodalart

liuhuadai

in FunAudioLLM/ThinkSound 5 months ago

VAE License

👀 1

#3 opened 5 months ago by

Fauno15

liuhuadai

updated a model 5 months ago

FunAudioLLM/ThinkSound

Video-to-Video • Updated Jul 17 • 48

liuhuadai

in FunAudioLLM/ThinkSound 5 months ago

Thankyou FunAdioLLM Team!

#2 opened 5 months ago by

Narutoouz

wenmengzhou

authored a paper 8 months ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26 • 55

iris2c

authored 3 papers 9 months ago

Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

Paper • 2305.10786 • Published May 18, 2023

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

Paper • 2312.11825 • Published Dec 19, 2023

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published Jan 10 • 52

AI & ML interests

Recent Activity

Team members 14

FunAudioLLM's activity

ThinkSound

feat: Enable MCP

VAE License

Thankyou FunAdioLLM Team!