MMMU

non-profit

https://mmmu-benchmark.github.io/

MMMU-Benchmark

Activity Feed Request to join this org

AI & ML interests

Multimodal Model Evaluation

Recent Activity

wren93 authored a paper 2 days ago

Scaling Zero-Shot Reference-to-Video Generation

wren93 authored a paper 2 days ago

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

wren93 authored a paper 2 days ago

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

View all activity

wren93

authored 3 papers 2 days ago

Scaling Zero-Shot Reference-to-Video Generation

Paper • 2512.06905 • Published Dec 7, 2025 • 28

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Paper • 2512.07802 • Published Dec 8, 2025 • 44

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

Paper • 2512.21338 • Published 29 days ago • 21

zhangysk

authored 6 papers 10 days ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published Dec 14, 2025 • 43

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published 13 days ago • 48

yuanshengni

in MMMU/MMMU 10 days ago

wrong_use，need deleted

#6 opened 17 days ago by

Aros199

yuexiang96

authored 4 papers about 1 month ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28, 2025 • 28

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 45

Simulating Environments with Reasoning Models for Agent Training

Paper • 2511.01824 • Published Nov 3, 2025 • 2

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 37

zhangysk

submitted a paper to Daily Papers about 1 month ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published Dec 14, 2025 • 43

wren93

authored a paper about 2 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 72

zhangysk

authored a paper about 2 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 293

zhangysk

authored 3 papers 2 months ago

MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

Paper • 2511.03146 • Published Nov 5, 2025 • 7

RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization

Paper • 2511.04285 • Published Nov 6, 2025 • 7

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Paper • 2511.07250 • Published Nov 10, 2025 • 17

AI & ML interests

Recent Activity

Team members 17

MMMU's activity

wrong_use，need deleted