ppo Agent playing BipedalWalkerHardcore-v3
This is a trained model of a ppo agent playing BipedalWalkerHardcore-v3 using the stable-baselines3 library.
- Downloads last month
- 3
Evaluation results
- mean_reward on BipedalWalkerHardcore-v3self-reported8.23 +/- 83.71