SWE-RL solving Github issues with Agentless scaffold and RL rasdani/deepseek_r1_qwen14b_swe_rl_8k 15B • Updated Jul 12, 2025 • 2 • 1 rasdani/deepseek_r1_llama_8b_swe_rl_8k_12_epochs 8B • Updated Jul 10, 2025 • 3 • 1 rasdani/SkyRL-v0-293-data-oracle-8k-context Viewer • Updated Jul 11, 2025 • 145 • 5 • 1
smolR1 reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs rasdani/smolR1-Qwen2.5-0.5B Text Generation • 0.5B • Updated Mar 31, 2025 • 2 • rasdani/simplerl_qwen_level1to4 Viewer • Updated Mar 29, 2025 • 8.14k • 12
SWE-RL solving Github issues with Agentless scaffold and RL rasdani/deepseek_r1_qwen14b_swe_rl_8k 15B • Updated Jul 12, 2025 • 2 • 1 rasdani/deepseek_r1_llama_8b_swe_rl_8k_12_epochs 8B • Updated Jul 10, 2025 • 3 • 1 rasdani/SkyRL-v0-293-data-oracle-8k-context Viewer • Updated Jul 11, 2025 • 145 • 5 • 1
smolR1 reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs rasdani/smolR1-Qwen2.5-0.5B Text Generation • 0.5B • Updated Mar 31, 2025 • 2 • rasdani/simplerl_qwen_level1to4 Viewer • Updated Mar 29, 2025 • 8.14k • 12
rasdani/SkyRL-v0-293-data-oracle-4k-context-100-epochs Viewer • Updated Jul 23, 2025 • 7.2k • 14
rasdani/SkyRL-v0-293-data-oracle-8k-context-100-epochs Viewer • Updated Jul 21, 2025 • 13.1k • 5
rasdani/deepseek_r1_llama_8b_swe_rl_8k_12_epochs_preds_100_v2 Viewer • Updated Jul 16, 2025 • 100 • 17