94 3

Ming Chen

ChenMing-thu14

AI & ML interests

3D Human Pose Estimation

Recent Activity

upvoted a paper about 22 hours ago

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

upvoted a paper 3 days ago

HDR Video Generation via Latent Alignment with Logarithmic Encoding

upvoted a paper 6 days ago

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

View all activity

Organizations

None yet

upvoted a paper about 22 hours ago

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

Paper • 2604.19636 • Published 2 days ago • 72

upvoted a paper 3 days ago

HDR Video Generation via Latent Alignment with Logarithmic Encoding

Paper • 2604.11788 • Published 10 days ago • 6

upvoted a paper 6 days ago

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Paper • 2604.14268 • Published 8 days ago • 107

upvoted a paper 7 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 8 days ago • 150

upvoted 2 papers 9 days ago

LPM 1.0: Video-based Character Performance Model

Paper • 2604.07823 • Published 14 days ago • 74

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published 10 days ago • 70

upvoted a paper 10 days ago

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Paper • 2604.08995 • Published 13 days ago • 47

upvoted a paper 22 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 25 days ago • 144

upvoted 2 papers 24 days ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Paper • 2603.25730 • Published 27 days ago • 53

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published 27 days ago • 155

upvoted a paper 30 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published about 1 month ago • 123

upvoted 8 papers about 1 month ago

Versatile Editing of Video Content, Actions, and Dynamics without Training

Paper • 2603.17989 • Published Mar 18 • 17

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Paper • 2603.19228 • Published Mar 19 • 68

WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

Paper • 2603.16871 • Published Mar 17 • 60

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published Mar 16 • 153

OmniForcing: Unleashing Real-time Joint Audio-Visual Generation

Paper • 2603.11647 • Published Mar 12 • 31

ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation

Paper • 2603.11421 • Published Mar 12 • 34

EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation

Paper • 2603.06014 • Published Mar 6 • 9

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 119

upvoted a paper about 2 months ago

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published Mar 4 • 186

Ming Chen

AI & ML interests

Recent Activity

Organizations

ChenMing-thu14's activity