Suyash Fulay's picture

Suyash Fulay

sfulay

http://suyashfulay.com

sfulay

AI & ML interests

NLP, CSS

Organizations

None yet

Papers 1

arxiv:2409.05283

models 66

sfulay/zephyr-7b-dpo-full-gpt_consistent-reward-scale-05

Updated Sep 3, 2024 • 4

sfulay/zephyr-7b-dpo-full-prometheus-reward-scale-1-rpo

Updated Sep 3, 2024 • 9

sfulay/zephyr-7b-dpo-full-gpt-reward-scale-05

Updated Sep 3, 2024 • 8

sfulay/zephyr-7b-dpo-full-gpt_consistent-reward-scale-1-rpo-gamma-05

Updated Sep 3, 2024 • 8

sfulay/zephyr-7b-dpo-full-gpt-reward-scale-1-rpo

Updated Sep 3, 2024 • 5

sfulay/zephyr-7b-dpo-full-gpt_consistent-reward-scale-1-rpo-gamma-2

Updated Sep 3, 2024 • 4

sfulay/zephyr-7b-dpo-full-gpt_consistent-reward-scale-1-rpo

Updated Sep 2, 2024 • 12

sfulay/zephyr-7b-dpo-full-gpt-reward-scale-01

Updated Sep 2, 2024 • 8

sfulay/zephyr-7b-dpo-full-gpt-reward-scale-1

Updated Aug 29, 2024 • 7

sfulay/zephyr-7b-dpo-full-gpt-low-curriculum

Updated Aug 29, 2024 • 8

datasets

None public yet