3 8 2

Pinxin Liu PRO

pliu23

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination

upvoted a paper about 2 months ago

Directional Reasoning Injection for Fine-Tuning MLLMs

upvoted a paper 2 months ago

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

View all activity

Organizations

None yet

upvoted a paper 14 days ago

Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination

Paper • 2511.17490 • Published 18 days ago • 21

upvoted a paper about 2 months ago

Directional Reasoning Injection for Fine-Tuning MLLMs

Paper • 2510.15050 • Published Oct 16 • 11

upvoted a paper 2 months ago

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

Paper • 2510.05034 • Published Oct 6 • 48

updated a dataset 3 months ago

pliu23/BEATv2

Updated Sep 2 • 17

published a dataset 3 months ago

pliu23/BEATv2

Updated Sep 2 • 17

liked a dataset 6 months ago

yunlong10/MMPerspective

Viewer • Updated Jun 13 • 5.08k • 302 • 7

New activity in Qwen/Qwen2.5-VL-32B-Instruct-AWQ 6 months ago

AssertionError: Both operands must be same dtype. Got fp16 and bf16

#8 opened 8 months ago by

treehugg3

liked a Space 6 months ago

Perceptual Copilot

👁

Interact with an AI assistant using your camera and chat

upvoted a paper 7 months ago

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

Paper • 2505.20426 • Published May 26 • 7

updated a model 8 months ago

pliu23/GestureLSM

Updated Apr 16 • 1

upvoted 2 papers 8 months ago

Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1)

Paper • 2504.03151 • Published Apr 4 • 15

FreSca: Unveiling the Scaling Space in Diffusion Models

Paper • 2504.02154 • Published Apr 2 • 18

updated a dataset 8 months ago

pliu23/BEAT-2-Render

Updated Mar 29 • 12

published a dataset 8 months ago

pliu23/BEAT-2-Render

Updated Mar 29 • 12

authored 6 papers 9 months ago

GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling

Paper • 2501.18898 • Published Jan 31