AI & ML interests
None yet
Organizations
None yet
models
19
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new
Reinforcement Learning
•
2B
•
Updated
•
2
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new
Reinforcement Learning
•
2B
•
Updated
•
3
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L2048_new
Reinforcement Learning
•
2B
•
Updated
•
1
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L2048_new
Reinforcement Learning
•
2B
•
Updated
•
3
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L1024
Reinforcement Learning
•
2B
•
Updated
•
4
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L1024
Reinforcement Learning
•
2B
•
Updated
•
5
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L1024
Reinforcement Learning
•
2B
•
Updated
•
4
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L1024
Reinforcement Learning
•
2B
•
Updated
•
5
AzalKhan/Qwen2.5-1.5B-Instruct_open-r1-DAPO-Math-17k-Processed_294
Reinforcement Learning
•
2B
•
Updated
•
2
AzalKhan/Qwen2.5-1.5B_open-r1-DAPO-Math-17k-Processed_882
Reinforcement Learning
•
2B
•
Updated
•
1