2 589

Lei Wang

demolei

https://demoleiwang.github.io/HomePage/

AI & ML interests

LLMs

Recent Activity

upvoted a paper 4 days ago

Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis

upvoted a paper 4 days ago

Self-Distilled Agentic Reinforcement Learning

upvoted a paper 6 days ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

View all activity

Organizations

Collections 3

View 3 collections

Papers 20

models 6

datasets 0

None public yet

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Language Modeling Is Compression

SlimPajama-DC: Understanding Data Combinations for LLM Training

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Language Modeling Is Compression

SlimPajama-DC: Understanding Data Combinations for LLM Training

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Papers 20

models 6

demolei/qwen2_5_vl_7b_grpo_chartqa_filtered_40

demolei/Qwen2.5-VL-7B-Instruct-chartqa_filtered_240

demolei/Qwen2.5-1.5B-Open-R1-Distill

demolei/Qwen-2.5-7B-Simple-RL

demolei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

demolei/sft_openassistant-guanaco

datasets 0

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Papers 20

models 6 Sort: Recently updated

datasets 0

models 6