Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference

Commit History

Model save
d52283b
verified

Zhangchen Xu commited on

Training in progress, step 1531
e0f4dd7
verified

Zhangchen Xu commited on

Training in progress, step 1500
72f60d1
verified

Zhangchen Xu commited on

Training in progress, step 1000
c53d548
verified

Zhangchen Xu commited on

Training in progress, step 500
8656d23
verified

Zhangchen Xu commited on

initial commit
c99f097
verified

Zhangchen Xu commited on