-
Attention Is All You Need
Paper • 1706.03762 • Published • 115 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 19 -
LLaMA: Open and Efficient Foundation Language Models
Paper • 2302.13971 • Published • 21 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 250
Collections
Discover the best community collections!
Collections including paper arxiv:2409.12186
-
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper • 2411.04905 • Published • 127 -
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Paper • 2405.04324 • Published • 25 -
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 153
-
Qwen Technical Report
Paper • 2309.16609 • Published • 38 -
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper • 2311.07919 • Published • 10 -
Qwen2 Technical Report
Paper • 2407.10671 • Published • 168 -
Qwen2-Audio Technical Report
Paper • 2407.10759 • Published • 64
-
Phi-4 Technical Report
Paper • 2412.08905 • Published • 122 -
Evaluating and Aligning CodeLLMs on Human Preference
Paper • 2412.05210 • Published • 50 -
Evaluating Language Models as Synthetic Data Generators
Paper • 2412.03679 • Published • 47 -
Yi-Lightning Technical Report
Paper • 2412.01253 • Published • 28
-
Attention Is All You Need
Paper • 1706.03762 • Published • 115 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 19 -
LLaMA: Open and Efficient Foundation Language Models
Paper • 2302.13971 • Published • 21 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 250
-
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper • 2411.04905 • Published • 127 -
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Paper • 2405.04324 • Published • 25 -
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 153
-
Qwen Technical Report
Paper • 2309.16609 • Published • 38 -
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper • 2311.07919 • Published • 10 -
Qwen2 Technical Report
Paper • 2407.10671 • Published • 168 -
Qwen2-Audio Technical Report
Paper • 2407.10759 • Published • 64
-
Phi-4 Technical Report
Paper • 2412.08905 • Published • 122 -
Evaluating and Aligning CodeLLMs on Human Preference
Paper • 2412.05210 • Published • 50 -
Evaluating Language Models as Synthetic Data Generators
Paper • 2412.03679 • Published • 47 -
Yi-Lightning Technical Report
Paper • 2412.01253 • Published • 28