yueqin yin's picture

3 1 1

yueqin yin

yyqoni

·

AI & ML interests

None yet

Recent Activity

updated a collection about 1 month ago

DenseRewardRLHF-PPO

updated a model about 1 month ago

yyqoni/Phi-3-mini-4k-bandit-ppo-60k

upvoted a paper about 1 month ago

Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model

View all activity

Organizations

yyqoni's activity

upvoted a paper about 1 month ago

Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model

Paper • 2501.02790 • Published Jan 6 • 9