Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Zhangchen Xu
Training in progress, step 1531
e0f4dd7 verified