Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLHFlow
/
Qwen2.5-7B-DPO
like
0
Follow
RLHFlow
135
Safetensors
qwen2
arxiv:
2405.07863
Model card
Files
Files and versions
Community
b77082a
Qwen2.5-7B-DPO
Ctrl+K
Ctrl+K
2 contributors
History:
1 commit
HanningZhang
initial commit
b77082a
verified
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago