weishen's picture

8 7 27

weishen

fakerbaby

·

fakerbaby

AI & ML interests

NLP, alignment, LLM

Organizations

fakerbaby's activity

upvoted a collection 3 months ago

Medical QA Datasets

A collection of medical question answering (QA) datasets • 20 items • Updated Nov 25, 2024 • 29

upvoted 2 collections 5 months ago

Infinity Instruct

16 items • Updated 2 days ago • 8

DeepSeekCoder-V2

6 items • Updated Sep 5, 2024 • 92

upvoted a paper 8 months ago

Secrets of RLHF in Large Language Models Part I: PPO

Paper • 2307.04964 • Published Jul 11, 2023 • 29

upvoted 2 collections 9 months ago

MoEs papers reading list

60 items • Updated Nov 4, 2024 • 139

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 227

upvoted a paper over 1 year ago

Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback

Paper • 2310.05199 • Published Oct 8, 2023 • 1