JackyChunKit/BDIreward_data3000_4e-7_BS64_ro16_len1400_relen512_trainon_Qwen3_cot93_eCeLLM-M_450 7B • Updated Jun 1, 2025
JackyChunKit/BDIreward_data3000_4e-7_BS64_ro16_len1400_relen2048_trainon_Qwen3_cot186_eCeLLM-M_50 7B • Updated Jun 1, 2025
JackyChunKit/BDIreward_data3000_4e-7_BS64_ro16_len1400_relen512_trainon_Qwen3_cot93_eCeLLM-M_400 7B • Updated Jun 1, 2025
JackyChunKit/BDIreward_data3000shuffle_mistral_7b_GRPO_4e_7_BS64_len1400_relen1024_trainon_eCeLLM-Mcot161_300 7B • Updated Jun 1, 2025
JackyChunKit/BDIreward_data3000shuffle_mistral_7b_GRPO_4e_7_BS64_len1400_relen1024_trainon_eCeLLM-Mcot161_250 7B • Updated Jun 1, 2025
JackyChunKit/eCeLLMfilter_data3000shuffle_GRPO_4e_7_BS364_ro16_len1800_relen1024_trainon_eCeLLM_50 7B • Updated Jun 1, 2025 • 3
JackyChunKit/sft_lr1e-5_epochs10_Qwen3_cot_Qwen2.5-14B-Instruct_global_step_1309 15B • Updated Jun 1, 2025
JackyChunKit/sft_lr1e-5_epochs10_Qwen3_cot_Qwen2.5-14B-Instruct_global_step_187 15B • Updated Jun 1, 2025 • 1
JackyChunKit/Nothink_qwen25_7b_GROP_ep10_bs32_lr4e-7_len600_trainonSFT5454_global_step_200 8B • Updated Jun 1, 2025
JackyChunKit/qwen25_7b_Instruct_GROP_ep10_bs32_lr4e-7_len512_step150 Text Generation • 7B • Updated Apr 27, 2025
JackyChunKit/qwen25_7b_Instruct_GROP_ep10_bs32_lr4e-7_len512_step100 Text Generation • 7B • Updated Apr 27, 2025
JackyChunKit/qwen25_7b_Instruct_GROP_ep10_bs32_lr4e-7_len512_step50 Text Generation • 7B • Updated Apr 27, 2025