Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
MagpieLM-8B-Chat-v0.1 / config.json

Commit History

End of training
268dd21
verified

Zhangchen Xu commited on

Training in progress, step 500
8656d23
verified

Zhangchen Xu commited on