zpysky1125
pyzhao
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
15 days ago
Exploring Data Scaling Trends and Effects in Reinforcement Learning from
Human Feedback
liked
a model
3 months ago
MiniMaxAI/MiniMax-Text-01
upvoted
a
paper
3 months ago
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language
Models
Organizations
models
None public yet
datasets
None public yet