Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
OpenRLHF
community
https://github.com/OpenRLHF
Activity Feed
Follow
32
AI & ML interests
None defined yet.
Recent Activity
Longhui98
authored
a paper
21 days ago
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Longhui98
authored
a paper
21 days ago
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Longhui98
authored
a paper
21 days ago
Forward-Backward Reasoning in Large Language Models for Mathematical Verification
View all activity
Team members
7
models
10
Sort: Recently updated
OpenRLHF/Llama-3-8b-rm-mixture
Updated
Nov 30, 2024
•
6.66k
OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt
Updated
Nov 30, 2024
•
10
•
1
OpenRLHF/Llama-3-8b-rm-700k
Updated
Nov 30, 2024
•
1.58k
•
3
OpenRLHF/Mistral-7b-PRM-Math-Shepherd
Updated
Oct 30, 2024
•
13
•
1
OpenRLHF/Llama-3-8b-iter-dpo-179k
Text Generation
•
Updated
Jul 28, 2024
•
15
OpenRLHF/Llama-3-8b-rlhf-100k
Text Generation
•
Updated
Jun 24, 2024
•
476
•
3
OpenRLHF/Llama-3-8b-sft-mixture
Text Generation
•
Updated
Jun 14, 2024
•
13.6k
•
1
OpenRLHF/Llama-2-7b-sft-model-ocra-500k
Text Generation
•
Updated
Jun 9, 2024
•
84
OpenRLHF/Llama-2-13b-rm-anthropic_hh-lmsys-oasst-webgpt
Updated
Jan 24, 2024
•
9
OpenRLHF/Llama-2-13b-sft-model-ocra-500k
Text Generation
•
Updated
Jan 5, 2024
•
202
•
1
datasets
4
Sort: Recently updated
OpenRLHF/prompt-collection-v0.1-dev-100k
Viewer
•
Updated
Dec 13, 2024
•
102k
•
88
OpenRLHF/preference_700K
Viewer
•
Updated
Jul 13, 2024
•
700k
•
72
•
1
OpenRLHF/prompt-collection-v0.1
Viewer
•
Updated
Jun 14, 2024
•
179k
•
2.48k
•
7
OpenRLHF/preference_dataset_mixture2_and_safe_pku
Viewer
•
Updated
Jun 14, 2024
•
555k
•
942
•
3