Investigating Regularization of Self-Play Language Models Paper • 2404.04291 • Published Apr 4, 2024 • 1
Robustness and risk management via distributional dynamic programming Paper • 2112.15430 • Published Dec 28, 2021
Beyond Log-Concavity: Theory and Algorithm for Sum-Log-Concave Optimization Paper • 2309.15298 • Published Sep 26, 2023