Miakat's picture

Miakat

mimipynb
·

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago
Anthropic/hh-rlhf
updated a collection 1 day ago
InstructModelDataset
liked a dataset 1 day ago
SoftAge-AI/rlhf-qa_dataset
View all activity

Organizations

None yet

mimipynb's activity

upvoted an article 3 months ago
view article
Article

Preference Tuning LLMs with Direct Preference Optimization Methods

55
upvoted an article 8 months ago
view article
Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

228
upvoted an article 12 months ago
view article
Article

The Technology Behind BLOOM Training

30