Guofeng Yi

YShow

AI & ML interests

LLMs&LMMs; AI Agent; on-device LMs

Recent Activity

Organizations

None yet

YShow's activity

upvoted an article 7 days ago
view article
Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By NormalUhr •
• 12