README / README.md
Di Zhang
Update README.md
f62e303 verified
|
raw
history blame
861 Bytes
---
title: README
emoji: πŸ‘€
colorFrom: yellow
colorTo: indigo
sdk: static
pinned: false
---
The first version of LLaMA-O1 has been uploaded to HF now!Here He Comes!
Supervised:
https://huggingface.co/SimpleBerry/LLaMA-O1-Supervised-1129
Base(Pretrain):
https://huggingface.co/SimpleBerry/LLaMA-O1-Base-1127
Supervised Finetune Dataset:
https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-SFT
Pretraining Dataset:
https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-Pretrain-1202
RLHF is on the way! View our GitHub Repo:
https://github.com/SimpleBerry/LLaMA-O1
Our ongoing related researches:
https://huggingface.co/papers/2406.07394
https://huggingface.co/papers/2410.02884
https://huggingface.co/papers/2411.18203
![image/png](https://cdn-uploads.huggingface.co/production/uploads/64bce15bafd1e46c5504ad38/4Mi1LVWPx8z4wOlfNgl2e.png)