umangkaushik

ubermenchh

AI & ML interests

None yet

Recent Activity

updated a model about 19 hours ago
ubermenchh/llama3.1-8B-gsm8k-grpo
liked a dataset about 20 hours ago
open-r1/OpenR1-Math-Raw
published a model 1 day ago
ubermenchh/llama3.1-8B-gsm8k-grpo
View all activity

Organizations

Social Post Explorers's profile picture

ubermenchh's activity

upvoted an article 7 days ago
view article
Article

The N Implementation Details of RLHF with PPO

โ€ข 37
upvoted an article 7 months ago
view article
Article

The Annotated Diffusion Model

โ€ข 143