Kazuki1450/Qwen2.5-1.5B-Instruct_lightr1_cmpl8192_bsz96_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 27 days ago • 119
Kazuki1450/Qwen2.5-1.5B-Instruct_lightr1_cmpl8192_bsz96_1p0_0p0_1p0_grpo_42_Qwen3-4B Text Generation • 2B • Updated 27 days ago • 88
Kazuki1450/Qwen2.5-1.5B-Instruct_lightr1_cmpl8192_bsz96_1p0_0p0_1p0_grpo_42_Qwen3-1.7B Text Generation • 2B • Updated 27 days ago • 94
ahme0599/Qwen_Qwen2.5-1.5B-Instruct-GRPO-vanilla_G_4 Text Generation • 2B • Updated 14 days ago • 295
edith81/Qwen2.5-1.5B-Vietnamese-MCQ-AllSubjects-merged Multiple Choice • 2B • Updated 18 days ago • 18
edith81/Qwen2.5-1.5B-Vietnamese-MCQ-AllSubjects-v4-merged Multiple Choice • 2B • Updated 18 days ago • 25