Guofeng Yi
YShow
·
AI & ML interests
LLMs&LMMs; AI Agent; on-device LMs
Recent Activity
Organizations
None yet
YShow's activity
-
-
-
-
-
-
-
-
-
-
-
view article
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
upvoted
a
paper
3 months ago