2 42 21

Wujian Peng

wjpoom

https://wjpoom.github.io/

wjpoom

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification

updated a model 4 days ago

ShareLab-SII/UniAR-RL

updated a model 4 days ago

ShareLab-SII/UniAR-SFT

View all activity

Organizations

upvoted a paper 4 days ago

Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification

Paper • 2606.18249 • Published 5 days ago • 14

updated 2 models 4 days ago

ShareLab-SII/UniAR-RL

Image-to-Text • 10B • Updated 4 days ago • 57

ShareLab-SII/UniAR-SFT

Image-to-Text • 10B • Updated 4 days ago • 104

published 2 models 4 days ago

ShareLab-SII/UniAR-RL

Image-to-Text • 10B • Updated 4 days ago • 57

ShareLab-SII/UniAR-SFT

Image-to-Text • 10B • Updated 4 days ago • 104

updated a collection 4 days ago

UniAR

Collection

Model checkpoints for UniAR: Unified Multimodal Autoregressive Modeling with Shared Context—Visual Tokenizer is Key to Unification. • 2 items • Updated 4 days ago

upvoted a paper 11 days ago

ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

Paper • 2606.11188 • Published 12 days ago • 26

upvoted a paper 23 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 24 days ago • 146

upvoted 2 papers about 1 month ago

World Action Models: The Next Frontier in Embodied AI

Paper • 2605.12090 • Published May 12 • 68

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published May 12 • 194

upvoted 2 papers 3 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 148

CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization

Paper • 2603.06449 • Published Mar 6 • 6

upvoted 2 papers 8 months ago

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 62

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15, 2025 • 48

upvoted a paper 10 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 90

updated a model about 1 year ago

wjpoom/SPEC-CLIP-ViT-B-32

Updated Jun 16, 2025 • 1

published a model about 1 year ago

wjpoom/SPEC-CLIP-ViT-B-32

Updated Jun 16, 2025 • 1

upvoted 2 papers about 1 year ago

Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Paper • 2505.18600 • Published May 24, 2025 • 49

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published May 18, 2025 • 24

Wujian Peng

AI & ML interests

Recent Activity

Organizations

wjpoom's activity