EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published 3 days ago • 21
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published 6 days ago • 43
Thinking with Programming Vision: Towards a Unified View for Thinking with Images Paper • 2512.03746 • Published 5 days ago • 15
OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published 6 days ago • 28
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 6 days ago • 175
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published 18 days ago • 91
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 25 days ago • 93
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published 26 days ago • 194
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6 • 208
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning Paper • 2510.27492 • Published Oct 30 • 81
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark Paper • 2510.26802 • Published Oct 30 • 33