mariosirt/EleutherAI-gpt-neo-125m-detoxified-perspective Reinforcement Learning • Updated Jun 11, 2023 • 7
Evan-Lin/Bart-RL-many-entailment-attractive-keywordmax Reinforcement Learning • Updated Jul 13, 2023 • 2
nlp-lab-2023-seq2seq/R-best-fine-tuned-bart-base-full-ft-reward_short_sentences_and_words-2023-07-13T06-49-08 Reinforcement Learning • Updated Aug 20, 2023 • 17 • 1
Evan-Lin/Bart-RL-many-keywordmax-entailment-attractive-reward1 Reinforcement Learning • Updated Jul 15, 2023 • 3
Evan-Lin/Bart-RL-many-keywordmax-entailment-attractive-reward2 Reinforcement Learning • Updated Jul 15, 2023 • 3
amirabdullah19852020/pythia_70m_ppo_imdb_sentiment_v2 Reinforcement Learning • Updated Jul 15, 2023 • 35
Evan-Lin/Bart-RL-many-keywordmax-entailment-attractive-reward5 Reinforcement Learning • Updated Jul 16, 2023 • 3
amirabdullah19852020/pythia_70m_ppo_imdb_sentiment_v3 Reinforcement Learning • Updated Jul 16, 2023 • 5
amirabdullah19852020/pythia_70m_ppo_imdb_sentiment_with_checkpoints Reinforcement Learning • Updated Jul 16, 2023 • 5
Ryukijano/Mujoco_rl_halfcheetah_Decision_Trasformer Reinforcement Learning • Updated Jul 22, 2023 • 5