A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
AI & ML interests
None defined yet.
Recent Activity
Papers
View all Papers datasets 7
JavisVerse/AV-FineTune
Viewer
• Updated
• 1.43M • 71
JavisVerse/JavisUnd-Eval
Updated
• 95
JavisVerse/MM-PreTrain
Viewer
• Updated
• 340k • 120
JavisVerse/JavisInst-Omni
Viewer
• Updated
• 91.4k • 231 • 1
JavisVerse/JavisBench
Viewer
• Updated
• 22.3k • 52
JavisVerse/JavisData-Audio
Viewer
• Updated
• 788k • 32
JavisVerse/TAVGBench_clean
Viewer
• Updated
• 1.58M • 9