RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies Paper • 2510.17950 • Published Oct 20 • 7
Running on Zero MCP Featured 306 ThinkSound 🔊 306 Generate audio for a video using captions and descriptions
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published Mar 26 • 55
Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings Paper • 2305.10786 • Published May 18, 2023
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation Paper • 2312.11825 • Published Dec 19, 2023
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published Jan 10 • 52