license: mit | |
# VIRL-VL-Init | |
This model serves as a initial checkpoint to reproduce results in paper **SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training**. | |
## Related links | |
Website: https://tianzhechu.com/SFTvsRL/ | |
Github: https://github.com/LeslieTrue/SFTvsRL | |
Arxiv: https://arxiv.org/abs/2501.17161v1 | |
HF: https://huggingface.co/papers/2501.17161 |