Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
vedantpalit
/
orcaminirlhfmodel
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
cc-by-nc-sa-4.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
orcaminirlhfmodel
Commit History
RLHF model of ORCA
24759ef
verified
vedantpalit
commited on
Mar 9, 2024
initial commit
e2a3930
verified
vedantpalit
commited on
Mar 9, 2024