Kopachelli 's Collections General
updated
Paper
• 2505.09388
• Published
• 338
Text Generation
• 15B • Updated
• 102k
• 73
Text Generation
• 8B • Updated
• 74.3k
• 161
Text Generation
• 4B • Updated
• 44.1k
• 86
Qwen2.5-Coder Technical Report
Paper
• 2409.12186
• Published
• 153
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation
• 8B • Updated
• 2.1M
• • 666
Text Generation
• 15B • Updated
• 25.1k
• • 68
Qwen/Qwen2.5-Coder-14B-Instruct
Text Generation
• 15B • Updated
• 513k
• • 143
Text Generation
• 8B • Updated
• 205k
• • 139
DeepSeek-V3 Technical Report
Paper
• 2412.19437
• Published
• 78
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Paper
• 2406.11931
• Published
• 69
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation
• Updated
• 309k
• • 219
Llama-Nemotron: Efficient Reasoning Models
Paper
• 2505.00949
• Published
• 41
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical
Reasoning Models with OpenMathReasoning dataset
Paper
• 2504.16891
• Published
• 25
OpenCodeReasoning-II: A Simple Test Time Scaling Approach via
Self-Critique
Paper
• 2507.09075
• Published
• 18
tencent/Hunyuan-7B-Instruct
Text Generation
• Updated
• 6.73k
• 87
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable
Reinforcement Learning
Paper
• 2507.01006
• Published
• 251
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
• 10B • Updated
• 368k
• 773
Image-Text-to-Text
• 10B • Updated
• 2.19k
• 65
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for
Sparse Architectural Large Language Models
Paper
• 2407.01906
• Published
• 46
deepseek-ai/deepseek-moe-16b-base
Text Generation
• 16B • Updated
• 24.6k
• 140
DeepSeekMoE: Towards Ultimate Expert Specialization in
Mixture-of-Experts Language Models
Paper
• 2401.06066
• Published
• 59
deepseek-ai/deepseek-moe-16b-chat
Text Generation
• 16B • Updated
• 21.3k
• 155
Skywork/Skywork-VL-Reward-7B
Image-Text-to-Text
• 8B • Updated
• 1.61k
• 47
Mungert/xLAM-2-32b-fc-r-GGUF
Text Generation
• 33B • Updated
• 332
• 5
8B • Updated
• 58
• 6
Mungert/Skywork-VL-Reward-7B-GGUF
Image-Text-to-Text
• 8B • Updated
• 102
Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B
Text Classification
• Updated
• 6.34k
• 33
jnorthrup/Skywork-o1-Open-PRM-Qwen-2.5-7B
Text Classification
• 8B • Updated
mistralai/Mixtral-8x7B-Instruct-v0.1
47B • Updated
• 693k
• 4.65k
Mungert/granite-guardian-3.1-8b-GGUF
Text Generation
• 8B • Updated
• 26
ariels/pest_twitter_geoparsing
Viewer
• Updated
• 678 • 9