Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
vishaljoshi24
/
trl-4-dnd
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
e2368f1
trl-4-dnd
/
trl
/
scripts
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
vishaljoshi24
Initial Commit
a080fe0
21 days ago
__init__.py
Safe
1 kB
Initial Commit
21 days ago
dpo.py
Safe
5.31 kB
Initial Commit
21 days ago
env.py
Safe
3.68 kB
Initial Commit
21 days ago
grpo.py
Safe
5.28 kB
Initial Commit
21 days ago
kto.py
Safe
4.19 kB
Initial Commit
21 days ago
sft.py
Safe
5.39 kB
Initial Commit
21 days ago
utils.py
Safe
11.3 kB
Initial Commit
21 days ago
vllm_serve.py
Safe
29 kB
Initial Commit
21 days ago