3 36 56

NAN

nan1248

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

tencent/HunyuanOCR

upvoted a paper about 1 month ago

Can Visual Input Be Compressed? A Visual Token Compression Benchmark for Large Multimodal Models

upvoted a paper about 1 month ago

DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

Can Visual Input Be Compressed? A Visual Token Compression Benchmark for Large Multimodal Models

Paper • 2511.02650 • Published Nov 4 • 9

DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents

Paper • 2510.19336 • Published Oct 22 • 16

upvoted a collection about 2 months ago

AndesVL

Collection

AndesVL is a suite of mobile-optimized Multimodal Large Language Models (MLLMs) with 0.6B to 4B parameters. • 8 items • Updated Oct 15 • 11

upvoted a paper about 2 months ago

AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model

Paper • 2510.11496 • Published Oct 13 • 3

upvoted an article 5 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Jul 18

•

upvoted a paper 5 months ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 36

upvoted a collection 5 months ago

SmolLM3 pretraining datasets

Collection

datasets used in SmolLM3 pretraining • 15 items • Updated Aug 12 • 39

upvoted a paper 5 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 75

upvoted 2 papers 6 months ago

CoMemo: LVLMs Need Image Context with Image Memory

Paper • 2506.06279 • Published Jun 6 • 8

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18 • 39

upvoted a paper 7 months ago

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published May 15 • 47

upvoted 2 papers 8 months ago

GenX: Mastering Code and Test Generation with Execution Feedback

Paper • 2412.13464 • Published Dec 18, 2024 • 1

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1 • 66

upvoted a paper 10 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

upvoted 2 articles 10 months ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

•

389

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

•

430

upvoted 2 papers 11 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 125

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429

upvoted a paper about 1 year ago

Large Language Model-Brained GUI Agents: A Survey

Paper • 2411.18279 • Published Nov 27, 2024 • 31

upvoted a collection about 1 year ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 7 days ago • 81

NAN

AI & ML interests

Recent Activity

Organizations

nan1248's activity

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

SmolVLM - small yet mighty Vision Language Model

SmolLM - blazingly fast and remarkably powerful