Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
3
Yihua Zhang
NormalUhr
Follow
flyingbugs's profile picture
Vijayendra's profile picture
arianspacial's profile picture
6 followers
·
2 following
AI & ML interests
None yet
Recent Activity
published
an
article
about 6 hours ago
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
published
an
article
4 days ago
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
published
an
article
7 days ago
A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons
View all activity
Organizations
NormalUhr
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
an
article
5 months ago
view article
Article
Optimizing your LLM in production
Sep 15, 2023
•
15