reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs
rasdani PRO
rasdani
AI & ML interests
None yet
Recent Activity
updated
a dataset
1 day ago
rasdani/deepseek_r1_qwen14b_swe_rl_8k_results
published
a dataset
1 day ago
rasdani/deepseek_r1_qwen14b_swe_rl_8k_results
updated
a dataset
2 days ago
rasdani/deepseek_r1_qwen14b_swe_rl_8k_56_steps_preds