3 18 8

WANG Jiong

wjwow

wangjiongw

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Geometrically-Constrained Agent for Spatial Reasoning

upvoted a paper about 1 month ago

RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

upvoted a paper about 1 month ago

Tongyi DeepResearch Technical Report

View all activity

Organizations

None yet

upvoted a paper 6 days ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published 10 days ago • 38

upvoted 2 papers about 1 month ago

RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

Paper • 2510.25590 • Published Oct 29 • 27

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 96

upvoted 2 papers about 2 months ago

A Survey of Vibe Coding with Large Language Models

Paper • 2510.12399 • Published Oct 14 • 48

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14 • 114

upvoted a paper 2 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 136

upvoted a paper 3 months ago

InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis

Paper • 2509.10441 • Published Sep 12 • 30

upvoted 3 papers 5 months ago

updated a dataset about 1 year ago

wjwow/FreeMan

Viewer • Updated Nov 5, 2024 • 895 • 45 • 8

upvoted a paper about 1 year ago

FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model

Paper • 2410.13925 • Published Oct 17, 2024 • 24

updated a collection over 1 year ago

MLLM

Collection

8 items • Updated Jul 16, 2024

upvoted a paper over 1 year ago

What If We Recaption Billions of Web Images with LLaMA-3?

Paper • 2406.08478 • Published Jun 12, 2024 • 41

updated a collection over 1 year ago

MLLM

Collection

8 items • Updated Jul 16, 2024

liked a dataset over 1 year ago

xcodemind/webcode2m

Viewer • Updated Mar 5 • 3.17M • 40.5k • 43

upvoted a paper over 1 year ago

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Paper • 2403.11703 • Published Mar 18, 2024 • 17

upvoted 2 papers almost 2 years ago

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

Paper • 2401.16420 • Published Jan 29, 2024 • 55

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Paper • 2401.15071 • Published Jan 26, 2024 • 37

updated a collection almost 2 years ago

MLLM

Collection

8 items • Updated Jul 16, 2024

WANG Jiong

AI & ML interests

Recent Activity

Organizations

wjwow's activity