arxiv:2406.05946
Ahmad Beirami
beirami
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
Theoretical guarantees on the best-of-n alignment policy
authored
a paper
7 months ago
Safety Alignment Should Be Made More Than Just a Few Tokens Deep
authored
a paper
10 months ago
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual
Alignment
Organizations
models
None public yet
datasets
None public yet