Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints

Commit History

Update README.md
60cf460
verified

Zhangchen Xu commited on

Update README.md
0b30eab
verified

Zhangchen Xu commited on

Update README.md
f75750c
verified

Zhangchen Xu commited on

End of training
268dd21
verified

Zhangchen Xu commited on

Model save
d52283b
verified

Zhangchen Xu commited on

Training in progress, step 1531
e0f4dd7
verified

Zhangchen Xu commited on

Training in progress, step 1500
72f60d1
verified

Zhangchen Xu commited on

Training in progress, step 1000
c53d548
verified

Zhangchen Xu commited on

Training in progress, step 500
8656d23
verified

Zhangchen Xu commited on

initial commit
c99f097
verified

Zhangchen Xu commited on