Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_result_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 17 hours ago • 43
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_python_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 17 hours ago • 11
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_print_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 17 hours ago • 18
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_output_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 17 hours ago • 22
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_assistant_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 17 hours ago • 28
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_Final_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 17 hours ago • 20
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_begin_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 18 hours ago • 13
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_array_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 18 hours ago • 13
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_Certainly_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 18 hours ago • 20
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_Alright_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 18 hours ago • 20
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split10 Viewer • Updated Dec 4, 2025 • 5.59k • 7