Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wxzhang
/
dpo-selective-bufferdata
like
0
Text Generation
Transformers
mistral
conversational
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
36f7a2e
dpo-selective-bufferdata
/
tokenizer.model
Commit History
Training in progress, step 500
9c3dc80
verified
wxzhang
commited on
Apr 3, 2024