droussis (Dimitris Roussis)

upvoted 2 papers 5 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 129

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 180

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

741

upvoted a paper 6 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 75

upvoted 2 papers 7 months ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5, 2025 • 20

Krikri: Advancing Open Large Language Models for Greek

Paper • 2505.13772 • Published May 19, 2025 • 6

upvoted 3 collections 8 months ago

upvoted a paper 9 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26, 2025 • 168

upvoted a paper 10 months ago

reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs

Paper • 2503.11751 • Published Mar 14, 2025 • 17

upvoted an article 10 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

Mar 12, 2025

•

480

upvoted 2 papers 10 months ago

Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs

Paper • 2502.14561 • Published Feb 20, 2025 • 2

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

upvoted a paper 11 months ago

Weighted-Reward Preference Optimization for Implicit Model Fusion

Paper • 2412.03187 • Published Dec 4, 2024 • 12

upvoted a paper about 1 year ago

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11, 2024 • 38

upvoted an article about 1 year ago

Article

Releasing the largest multilingual open pretraining dataset

Nov 13, 2024

•

104

upvoted a collection about 1 year ago

EuroLLM

Collection

8 items • Updated 18 days ago • 41

upvoted 2 papers over 1 year ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 29

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 74

Dimitris Roussis

AI & ML interests

Organizations

R-Zero: Self-Evolving Reasoning LLM from Zero Data

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

SmolLM3: smol, multilingual, long-context reasoner

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Krikri: Advancing Open Large Language Models for Greek

Krikri 8B

Meltemi 7B

ILSP Greek Evaluation Suite

Qwen2.5-Omni Technical Report

reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Weighted-Reward Preference Optimization for Implicit Model Fusion

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Releasing the largest multilingual open pretraining dataset

EuroLLM

EuroLLM: Multilingual Language Models for Europe

NVLM: Open Frontier-Class Multimodal LLMs

Dimitris Roussis

AI & ML interests

Organizations

droussis's activity

SmolLM3: smol, multilingual, long-context reasoner

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Releasing the largest multilingual open pretraining dataset