VIRL-VL-Init / README.md
tianzhechu's picture
Create README.md
b1b64a5 verified
|
raw
history blame
395 Bytes
---
license: mit
---
# VIRL-VL-Init
This model serves as a initial checkpoint to reproduce results in paper **SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training**.
## Related links
Website: https://tianzhechu.com/SFTvsRL/
Github: https://github.com/LeslieTrue/SFTvsRL
Arxiv: https://arxiv.org/abs/2501.17161v1
HF: https://huggingface.co/papers/2501.17161