Ahmad Beirami's picture

Ahmad Beirami

beirami

·

http://www.mit.edu/~beirami/

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

Theoretical guarantees on the best-of-n alignment policy

authored a paper 7 months ago

Safety Alignment Should Be Made More Than Just a Few Tokens Deep

authored a paper 10 months ago

Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment

View all activity

Organizations

Papers 7

arxiv:2406.05946

arxiv:2404.12318

arxiv:2401.16656

arxiv:2401.01879

models

None public yet

datasets

None public yet