RLHF-And-Friends/tldr-ppo-TLDR-Mistral-7B-Base-CoPPO-completions Viewer • Updated 10 days ago • 100 • 49
RLHF-And-Friends/tldr-ppo-TLDR-Mistral-7B-Base-CoPPO-completions Viewer • Updated 10 days ago • 100 • 49
RLHF-And-Friends/tldr-ppo-TLDR-Mistral-7B-SmallSFT-CoPPO-completions Viewer • Updated 10 days ago • 100 • 49
RLHF-And-Friends/tldr-ppo-TLDR-Mistral-7B-SmallSFT-CoPPO-completions Viewer • Updated 10 days ago • 100 • 49