Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
clembench-playpen
/
meta-llama-Meta-Llama-3.1-8B-Instruct_KTO_binary_dataset_wordle_wordlewithclue
like
0
Follow
clembench-project-playpen
17
Transformers
Safetensors
Generated from Trainer
unsloth
trl
kto
arxiv:
2402.01306
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
meta-llama-Meta-Llama-3.1-8B-Instruct_KTO_binary_dataset_wordle_wordlewithclue
Commit History
End of training
4f5c4c0
verified
mazzaqq
commited on
Feb 5
initial commit
df615f7
verified
mazzaqq
commited on
Feb 4