MattBou00/llama-3-2-1b-detox_v1d-checkpoint-epoch-60 Reinforcement Learning • 1B • Updated Aug 19 • 5
MattBou00/llama-3-2-1b-detox_v1e-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated Aug 20 • 4
MattBou00/llama-3-2-1b-detox_v1e-checkpoint-epoch-40 Reinforcement Learning • 1B • Updated Aug 20 • 5
MattBou00/llama-3-2-1b-detox_v1e-checkpoint-epoch-60 Reinforcement Learning • 1B • Updated Aug 20 • 5
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated 1 day ago • 12
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-40 Reinforcement Learning • 1B • Updated 1 day ago • 16
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-60 Reinforcement Learning • 1B • Updated 1 day ago • 11
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-80 Reinforcement Learning • 1B • Updated about 1 month ago • 12
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-100 Reinforcement Learning • 1B • Updated about 1 month ago • 11
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated about 1 month ago • 5
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-40 Reinforcement Learning • 1B • Updated about 1 month ago • 4
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-60 Reinforcement Learning • 1B • Updated about 1 month ago • 5
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-80 Reinforcement Learning • 1B • Updated about 1 month ago • 6
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-100 Reinforcement Learning • 1B • Updated about 1 month ago • 3
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated 29 days ago • 6
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-40 Reinforcement Learning • 1B • Updated 29 days ago • 6
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-60 Reinforcement Learning • 1B • Updated 29 days ago • 7
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-80 Reinforcement Learning • 1B • Updated 29 days ago • 6
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-100 Reinforcement Learning • 1B • Updated 29 days ago • 7
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated 29 days ago • 6
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-40 Reinforcement Learning • 1B • Updated 29 days ago • 7
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-60 Reinforcement Learning • 1B • Updated 29 days ago • 7
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-80 Reinforcement Learning • 1B • Updated 29 days ago • 7
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-100 Reinforcement Learning • 1B • Updated 29 days ago • 7
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated 29 days ago • 17
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-40 Reinforcement Learning • 1B • Updated 29 days ago • 15