-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 85 -
SlimPajama-DC: Understanding Data Combinations for LLM Training
Paper • 2309.10818 • Published • 11 -
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)
Paper • 2309.08968 • Published • 24 -
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 40
Lei Wang
demolei
AI & ML interests
LLMs
Recent Activity
upvoted a paper about 5 hours ago
Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models upvoted a paper 1 day ago
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning upvoted a paper 2 days ago
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents