Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Suyash Fulay
sfulay
Follow
http://suyashfulay.com
sfulay
AI & ML interests
NLP, CSS
Organizations
None yet
Papers
1
arxiv:
2409.05283
models
66
Sort: Recently updated
sfulay/zephyr-7b-dpo-full-gpt_consistent-reward-scale-05
Updated
Sep 3, 2024
•
4
sfulay/zephyr-7b-dpo-full-prometheus-reward-scale-1-rpo
Updated
Sep 3, 2024
•
9
sfulay/zephyr-7b-dpo-full-gpt-reward-scale-05
Updated
Sep 3, 2024
•
8
sfulay/zephyr-7b-dpo-full-gpt_consistent-reward-scale-1-rpo-gamma-05
Updated
Sep 3, 2024
•
8
sfulay/zephyr-7b-dpo-full-gpt-reward-scale-1-rpo
Updated
Sep 3, 2024
•
5
sfulay/zephyr-7b-dpo-full-gpt_consistent-reward-scale-1-rpo-gamma-2
Updated
Sep 3, 2024
•
4
sfulay/zephyr-7b-dpo-full-gpt_consistent-reward-scale-1-rpo
Updated
Sep 2, 2024
•
12
sfulay/zephyr-7b-dpo-full-gpt-reward-scale-01
Updated
Sep 2, 2024
•
8
sfulay/zephyr-7b-dpo-full-gpt-reward-scale-1
Updated
Aug 29, 2024
•
7
sfulay/zephyr-7b-dpo-full-gpt-low-curriculum
Updated
Aug 29, 2024
•
8
Expand 66 models
datasets
None public yet