inference-optimization/Llama-3.2-1B-Instruct-5-bits-mode-heuristic-per-tensor 1B • Updated about 17 hours ago • 16
inference-optimization/Llama-3.2-1B-Instruct-5-bits-mode-hybrid-per-tensor 1B • Updated about 17 hours ago • 10
inference-optimization/Llama-3.2-1B-Instruct-5-bits-mode-noise-per-tensor 1B • Updated about 17 hours ago • 7
inference-optimization/Llama-3.2-1B-Instruct-5.5-bits-mode-heuristic-per-tensor 1B • Updated about 17 hours ago • 10
inference-optimization/Llama-3.2-1B-Instruct-5.5-bits-mode-hybrid-per-tensor 1B • Updated about 17 hours ago • 8
inference-optimization/Llama-3.2-1B-Instruct-5.5-bits-mode-noise-per-tensor 1B • Updated about 17 hours ago • 8
inference-optimization/Llama-3.2-1B-Instruct-6-bits-mode-heuristic-per-tensor 1B • Updated about 17 hours ago • 7
inference-optimization/Llama-3.2-1B-Instruct-6-bits-mode-hybrid-per-tensor 1B • Updated about 17 hours ago • 7
inference-optimization/Llama-3.2-1B-Instruct-6-bits-mode-noise-per-tensor 1B • Updated about 17 hours ago • 14
inference-optimization/Llama-3.2-1B-Instruct-6.5-bits-mode-heuristic-per-tensor 1B • Updated about 17 hours ago • 6
inference-optimization/Llama-3.2-1B-Instruct-6.5-bits-mode-hybrid-per-tensor 1B • Updated about 17 hours ago • 7
inference-optimization/Llama-3.2-1B-Instruct-6.5-bits-mode-noise-per-tensor 1B • Updated about 17 hours ago • 6
inference-optimization/Llama-3.2-1B-Instruct-7-bits-mode-heuristic-per-tensor 1B • Updated about 17 hours ago • 8
inference-optimization/Llama-3.2-1B-Instruct-7-bits-mode-hybrid-per-tensor 1B • Updated about 17 hours ago • 8
inference-optimization/Llama-3.2-1B-Instruct-7-bits-mode-noise-per-tensor 1B • Updated about 17 hours ago • 9
inference-optimization/Llama-3.2-3B-Instruct-5-bits-mode-heuristic-per-tensor 3B • Updated about 17 hours ago • 8
inference-optimization/Llama-3.2-3B-Instruct-5-bits-mode-hybrid-per-tensor 3B • Updated about 17 hours ago • 8
inference-optimization/Llama-3.2-3B-Instruct-5-bits-mode-noise-per-tensor 3B • Updated about 17 hours ago • 7
inference-optimization/Llama-3.2-3B-Instruct-5.5-bits-mode-heuristic-per-tensor 3B • Updated about 17 hours ago • 6
inference-optimization/Llama-3.2-3B-Instruct-5.5-bits-mode-hybrid-per-tensor 3B • Updated about 17 hours ago • 4
inference-optimization/Llama-3.2-3B-Instruct-5.5-bits-mode-noise-per-tensor 3B • Updated about 17 hours ago • 11
inference-optimization/Llama-3.2-3B-Instruct-6-bits-mode-heuristic-per-tensor 3B • Updated about 17 hours ago • 14
inference-optimization/Llama-3.2-3B-Instruct-6-bits-mode-hybrid-per-tensor 3B • Updated about 17 hours ago • 8
inference-optimization/Llama-3.2-3B-Instruct-6-bits-mode-noise-per-tensor 3B • Updated about 17 hours ago • 14
inference-optimization/Llama-3.2-3B-Instruct-6.5-bits-mode-heuristic-per-tensor 3B • Updated about 17 hours ago • 8
inference-optimization/Llama-3.2-3B-Instruct-6.5-bits-mode-hybrid-per-tensor 3B • Updated about 17 hours ago • 9
inference-optimization/Llama-3.2-3B-Instruct-6.5-bits-mode-noise-per-tensor 3B • Updated about 17 hours ago • 5
inference-optimization/Llama-3.2-3B-Instruct-7-bits-mode-heuristic-per-tensor 3B • Updated about 17 hours ago • 12
inference-optimization/Llama-3.2-3B-Instruct-7-bits-mode-hybrid-per-tensor 3B • Updated about 17 hours ago • 4
inference-optimization/Llama-3.2-3B-Instruct-7-bits-mode-noise-per-tensor 3B • Updated about 17 hours ago • 13
inference-optimization/Llama-3.1-8B-Instruct-5-bits-mode-heuristic-per-tensor 5B • Updated about 17 hours ago • 9
inference-optimization/Llama-3.1-8B-Instruct-5-bits-mode-hybrid-per-tensor 5B • Updated about 17 hours ago • 7
inference-optimization/Llama-3.1-8B-Instruct-5-bits-mode-noise-per-tensor 5B • Updated about 17 hours ago • 6
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-heuristic-per-tensor 6B • Updated about 17 hours ago • 6
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-hybrid-per-tensor 6B • Updated about 17 hours ago • 11
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-noise-per-tensor 6B • Updated about 17 hours ago • 6
inference-optimization/Llama-3.1-8B-Instruct-6-bits-mode-heuristic-per-tensor 6B • Updated about 17 hours ago • 6
inference-optimization/Llama-3.1-8B-Instruct-6-bits-mode-hybrid-per-tensor 6B • Updated about 17 hours ago • 5
inference-optimization/Llama-3.1-8B-Instruct-6-bits-mode-noise-per-tensor 6B • Updated about 17 hours ago • 9
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-heuristic-per-tensor 7B • Updated about 17 hours ago • 9
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-hybrid-per-tensor 7B • Updated about 17 hours ago • 9
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-noise-per-tensor 7B • Updated about 17 hours ago • 8
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-heuristic-per-tensor 7B • Updated about 17 hours ago • 15
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-hybrid-per-tensor 7B • Updated about 17 hours ago • 9
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-noise-per-tensor 7B • Updated about 17 hours ago • 9
inference-optimization/Qwen3-8B-5.5-bits-mode-heuristic-per-tensor 6B • Updated about 17 hours ago • 9
inference-optimization/Qwen3-8B-6-bits-mode-heuristic-per-tensor 6B • Updated about 17 hours ago • 11
inference-optimization/Qwen3-8B-6.5-bits-mode-heuristic-per-tensor 7B • Updated about 17 hours ago • 7
inference-optimization/Qwen3-30B-A3B-5-bits-mode-heuristic-per-tensor 19B • Updated about 17 hours ago • 5
inference-optimization/Qwen3-30B-A3B-5-bits-mode-hybrid-per-tensor 19B • Updated about 17 hours ago • 6
inference-optimization/Qwen3-30B-A3B-5-bits-mode-noise-per-tensor 19B • Updated about 17 hours ago • 8
inference-optimization/Qwen3-30B-A3B-5.5-bits-mode-heuristic-per-tensor 21B • Updated about 17 hours ago • 6
inference-optimization/Qwen3-30B-A3B-5.5-bits-mode-hybrid-per-tensor 21B • Updated about 17 hours ago • 6
inference-optimization/Qwen3-30B-A3B-5.5-bits-mode-noise-per-tensor 21B • Updated about 17 hours ago • 8
inference-optimization/Qwen3-30B-A3B-6-bits-mode-heuristic-per-tensor 23B • Updated about 17 hours ago • 8
inference-optimization/Qwen3-30B-A3B-6-bits-mode-hybrid-per-tensor 23B • Updated about 17 hours ago • 12
inference-optimization/Qwen3-30B-A3B-6-bits-mode-noise-per-tensor 23B • Updated about 17 hours ago • 8
inference-optimization/Qwen3-30B-A3B-6.5-bits-mode-heuristic-per-tensor 25B • Updated about 17 hours ago • 13
inference-optimization/Qwen3-30B-A3B-6.5-bits-mode-hybrid-per-tensor 25B • Updated about 17 hours ago • 4
inference-optimization/Qwen3-30B-A3B-6.5-bits-mode-noise-per-tensor 25B • Updated about 17 hours ago • 7
inference-optimization/Qwen3-30B-A3B-7-bits-mode-heuristic-per-tensor 27B • Updated about 17 hours ago • 6
inference-optimization/Qwen3-30B-A3B-7-bits-mode-hybrid-per-tensor 27B • Updated about 17 hours ago • 11
inference-optimization/Qwen3-30B-A3B-7-bits-mode-noise-per-tensor 27B • Updated about 17 hours ago • 6