Zhaolin Gao

GitBag

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a dataset 30 minutes ago
GitBag/1744529623
updated a dataset 30 minutes ago
GitBag/1744529624
updated a dataset 30 minutes ago
GitBag/1744529647
View all activity

Organizations

Cornell-AGI's profile picture

Articles 1

Article
6

RLHF 101: A Technical Dive into RLHF