Ariel Kwiatkowski's picture

1 1 18

Ariel Kwiatkowski

RedTachyon

·

https://redtachyon.me

RedTachyon

AI & ML interests

RL, MARL, Crowd Simulation

Recent Activity

upvoted a paper about 1 month ago

PILAF: Optimal Human Preference Sampling for Reward Modeling

authored a paper about 1 month ago

PILAF: Optimal Human Preference Sampling for Reward Modeling

liked a dataset 2 months ago

AI-MO/NuminaMath-CoT

View all activity

Organizations

RedTachyon's activity

upvoted a paper about 1 month ago

PILAF: Optimal Human Preference Sampling for Reward Modeling

Paper • 2502.04270 • Published Feb 6 • 11