Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
vishaljoshi24
/
trl-4-dnd
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
64d1cf6
trl-4-dnd
/
trl
/
scripts
65.2 kB
1 contributor
History:
1 commit
vishaljoshi24
Initial Commit
a080fe0
3 months ago
__init__.py
Safe
1 kB
Initial Commit
3 months ago
dpo.py
Safe
5.31 kB
Initial Commit
3 months ago
env.py
Safe
3.68 kB
Initial Commit
3 months ago
grpo.py
Safe
5.28 kB
Initial Commit
3 months ago
kto.py
Safe
4.19 kB
Initial Commit
3 months ago
sft.py
Safe
5.39 kB
Initial Commit
3 months ago
utils.py
Safe
11.3 kB
Initial Commit
3 months ago
vllm_serve.py
Safe
29 kB
Initial Commit
3 months ago