ppo Agent playing BipedalWalkerHardcore-v3

This is a trained model of a ppo agent playing BipedalWalkerHardcore-v3 using the stable-baselines3 library.

Downloads last month
3
Video Preview
loading

Evaluation results

  • mean_reward on BipedalWalkerHardcore-v3
    self-reported
    8.23 +/- 83.71