Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RLHFlow
/
Qwen2.5-7B-DPO
like
0
Follow
RLHFlow
108
Safetensors
qwen2
arxiv:
2405.07863
Model card
Files
Files and versions
Community
main
Qwen2.5-7B-DPO
/
README.md
Commit History
Update README.md
e0f1fbd
verified
Chenlu123
commited on
26 days ago
Update README.md
4c75597
verified
Chenlu123
commited on
26 days ago
Update README.md
0de054e
verified
Chenlu123
commited on
26 days ago
Upload Qwen2ForCausalLM
5c1e7fb
verified
HanningZhang
commited on
26 days ago