Yihua Zhang

NormalUhr

AI & ML interests

None yet

Recent Activity

Organizations

OPTML Group @ MSU's profile picture

Articles 5

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Article
18

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

models

None public yet

datasets

None public yet