zjn's picture

1 8

zjn

garlicisnotmyfavor

·

AI & ML interests

multimodal LLMs && video generation

Recent Activity

upvoted a paper about 2 months ago

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

commented on a paper 2 months ago

PairUni: Pairwise Training for Unified Multimodal Language Models

upvoted a paper 3 months ago

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

View all activity

Organizations

None yet

authored a paper 6 months ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published Jul 10, 2025 • 49

authored a paper 10 months ago

VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model

Paper • 2502.18906 • Published Feb 26, 2025 • 12

authored 2 papers about 2 years ago

What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?

Paper • 2307.02469 • Published Jul 5, 2023 • 12

Make Pixels Dance: High-Dynamic Video Generation

Paper • 2311.10982 • Published Nov 18, 2023 • 68