11 16 30

Wang Chengyao

wcy1122

https://wcy1122.github.io/

AI & ML interests

Multimodal Intelligence

Recent Activity

updated a dataset 6 days ago

wcy1122/D1

published a dataset 7 days ago

wcy1122/D1

liked a model 7 days ago

deepseek-ai/DeepSeek-V3.2-Speciale

View all activity

Organizations

upvoted 2 papers about 1 month ago

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6 • 36

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27 • 174

upvoted 2 papers about 2 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266

DreamOmni2: Multimodal Instruction-based Editing and Generation

Paper • 2510.06679 • Published Oct 8 • 73

upvoted 2 papers 2 months ago

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech

Paper • 2509.25131 • Published Sep 29 • 15

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

upvoted 2 collections 4 months ago

DeepSeek-V3.1

Collection

4 items • Updated 11 days ago • 254

MGM-Omni

Collection

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech • 18 items • Updated Oct 11 • 10

upvoted 2 papers 5 months ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17 • 77

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 159

upvoted a paper 12 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 48

upvoted a paper about 1 year ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 118

upvoted 3 collections over 1 year ago

upvoted a paper over 1 year ago

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27, 2024 • 47

Wang Chengyao

AI & ML interests

Recent Activity

Organizations

wcy1122's activity