-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 152 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
Collections
Discover the best community collections!
Collections including paper arxiv:2602.22661
-
IndustryShapes: An RGB-D Benchmark dataset for 6D object pose estimation of industrial assembly components and tools
Paper • 2602.05555 • Published -
MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations
Paper • 2410.13790 • Published -
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 124 -
VGG-T^3: Offline Feed-Forward 3D Reconstruction at Scale
Paper • 2602.23361 • Published • 14
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 124 -
Fara-7B: An Efficient Agentic Model for Computer Use
Paper • 2511.19663 • Published • 15 -
LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference
Paper • 2510.09665 • Published • 4 -
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 39
-
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 106 -
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Paper • 2512.23447 • Published • 98 -
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper • 2512.23576 • Published • 65
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 128 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 129 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 87
-
DoPE: Denoising Rotary Position Embedding
Paper • 2511.09146 • Published • 97 -
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
Paper • 2511.19365 • Published • 64 -
Latent Collaboration in Multi-Agent Systems
Paper • 2511.20639 • Published • 124 -
Video Generation Models Are Good Latent Reward Models
Paper • 2511.21541 • Published • 46
-
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 14 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 233 -
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Paper • 2510.00515 • Published • 42 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 146
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 152 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
IndustryShapes: An RGB-D Benchmark dataset for 6D object pose estimation of industrial assembly components and tools
Paper • 2602.05555 • Published -
MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations
Paper • 2410.13790 • Published -
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 124 -
VGG-T^3: Offline Feed-Forward 3D Reconstruction at Scale
Paper • 2602.23361 • Published • 14
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 124 -
Fara-7B: An Efficient Agentic Model for Computer Use
Paper • 2511.19663 • Published • 15 -
LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference
Paper • 2510.09665 • Published • 4 -
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 39
-
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 106 -
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Paper • 2512.23447 • Published • 98 -
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper • 2512.23576 • Published • 65
-
DoPE: Denoising Rotary Position Embedding
Paper • 2511.09146 • Published • 97 -
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
Paper • 2511.19365 • Published • 64 -
Latent Collaboration in Multi-Agent Systems
Paper • 2511.20639 • Published • 124 -
Video Generation Models Are Good Latent Reward Models
Paper • 2511.21541 • Published • 46
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 128 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 129 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 87
-
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 14 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 233 -
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Paper • 2510.00515 • Published • 42 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 146