Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
weqweasdas
/
hh_rlhf_rm_open_llama_3b
like
17
Text Classification
Transformers
PyTorch
llama
text-generation-inference
Inference Endpoints
arxiv:
2304.06767
arxiv:
2306.12420
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
3f42bb4
hh_rlhf_rm_open_llama_3b
Commit History
Adding `safetensors` variant of this model
3f42bb4
verified
SFconvertbot
commited on
Nov 2, 2024
Update README.md
06bd94a
verified
weqweasdas
commited on
Feb 25, 2024
Update README.md
e3c1d3f
verified
weqweasdas
commited on
Feb 25, 2024
Update README.md
13f510a
weqweasdas
commited on
Dec 24, 2023
Update README.md
7cad203
weqweasdas
commited on
Nov 27, 2023
Update README.md
35700d2
weqweasdas
commited on
Nov 2, 2023
Update README.md
60ccec5
weqweasdas
commited on
Aug 4, 2023
Upload llama_reward.png
4baeddc
weqweasdas
commited on
Aug 4, 2023
Upload raft.png
7058ee7
weqweasdas
commited on
Jul 26, 2023
Update README.md
5015d45
weqweasdas
commited on
Jul 26, 2023
Update README.md
23cf908
weqweasdas
commited on
Jul 18, 2023
Create README.md
7ecd22c
weqweasdas
commited on
Jul 14, 2023
upload models
dcd09c3
weqweasdas
commited on
Jul 14, 2023
initial commit
9f9a138
weqweasdas
commited on
Jul 14, 2023