SWE-RL solving Github issues with Agentless scaffold and RL rasdani/deepseek_r1_qwen14b_swe_rl_8k 15B • Updated Jul 12 • 9 rasdani/deepseek_r1_llama_8b_swe_rl_8k_12_epochs 8B • Updated Jul 10 • 6 rasdani/SkyRL-v0-293-data-oracle-8k-context Viewer • Updated Jul 11 • 145 • 25
smolR1 reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs rasdani/smolR1-Qwen2.5-0.5B Text Generation • 0.5B • Updated Mar 31 • 10 rasdani/simplerl_qwen_level1to4 Viewer • Updated Mar 29 • 8.14k • 11
SWE-RL solving Github issues with Agentless scaffold and RL rasdani/deepseek_r1_qwen14b_swe_rl_8k 15B • Updated Jul 12 • 9 rasdani/deepseek_r1_llama_8b_swe_rl_8k_12_epochs 8B • Updated Jul 10 • 6 rasdani/SkyRL-v0-293-data-oracle-8k-context Viewer • Updated Jul 11 • 145 • 25
smolR1 reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs rasdani/smolR1-Qwen2.5-0.5B Text Generation • 0.5B • Updated Mar 31 • 10 rasdani/simplerl_qwen_level1to4 Viewer • Updated Mar 29 • 8.14k • 11