Spaces:
Runtime error
Runtime error
| title: SonicVerse | |
| emoji: πΌ | |
| colorFrom: purple | |
| colorTo: red | |
| sdk: gradio | |
| sdk_version: 5.25.2 | |
| app_file: app.py | |
| pinned: false | |
| # πΌΒ SonicVerse | |
| An interactive demo for SonicVerse, a music captioning model, allowing users to input audio of up to 10 seconds and generate a natural language caption | |
| that includes a general description of the music as well as music features such as key, instruments, genre, mood / theme, vocals gender. | |
| --- | |
| ## π Demo | |
| Check out the live Space here: | |
| [](https://huggingface.co/spaces/amaai-lab/SonicVerse) | |
| --- | |
| ## π Samples | |
| Short captions | |
| --- | |
| ## π¦ Features | |
| β Upload a 10 second music clip and get a caption | |
| β Upload a long music clip (upto 1 minute for successful demo) to get a long detailed caption for the whole music clip. | |
| --- | |
| ## π οΈ How to Run Locally | |
| ```bash | |
| # Clone the repo | |
| git clone https://github.com/AMAAI-Lab/SonicVerse | |
| cd SonicVerse | |
| # Install dependencies | |
| pip install -r requirements.txt | |
| # Alternatively, set up conda environment | |
| conda env create -f environment.yml | |
| conda activate sonicverse | |
| # Run the app | |
| python app.py | |
| ``` | |
| --- | |
| <!-- ## π File Structure | |
| ``` | |
| . | |
| βββ app.py # Web app file | |
| βββ requirements.txt # Python dependencies | |
| βββ environment.yml # Conda environment | |
| βββ README.md # This file | |
| βββ src/sonicverse # Source | |
| ``` | |
| --- --> | |
| ## π‘ Usage | |
| To use the app: | |
| 1. Select audio clip to input | |
| 2. Click the **Generate** button. | |
| 3. See the modelβs output below. | |
| --- | |
| ## π§Ή Built With | |
| - [Hugging Face Spaces](https://huggingface.co/spaces) | |
| - [Gradio](https://gradio.app/) | |
| - [Mistral 7B](https://huggingface.co/mistralai/Mistral-7B-v0.1) | |
| - [MERT 95M](https://huggingface.co/m-a-p/MERT-v1-95M) | |
| --- | |
| <!-- ## β¨ Acknowledgements | |
| - [Model authors or papers you built on] | |
| - [Contributors or collaborators] | |
| --- | |
| ## π License | |
| This project is licensed under the MIT License / Apache 2.0 / Other. | |
| --> | |