Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -34,7 +34,7 @@ If you want to contact us & join us, you can βοΈ to our team : <opendilab@p
|
|
34 |
| Algo.\Env. | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [BipedalWalker](https://di-engine-docs.readthedocs.io/en/latest/13_envs/bipedalwalker.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [SpaceInvaders](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Qbert](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Hopper](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) | [Halfcheetah](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) | [Walker2d](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) |
|
35 |
| :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
|
36 |
| [PPO](https://arxiv.org/pdf/1707.06347.pdf) | [β
](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-PPO) | | | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-PPO) | [β
](https://huggingface.co/OpenDILabCommunity/SpaceInvadersNoFrameskip-v4-PPO) | [β
](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-PPO) | [β
](https://huggingface.co/OpenDILabCommunity/Hopper-v3-PPO) | | |
|
37 |
-
| [PPO(Offpolicy)](https://arxiv.org/pdf/1707.06347.pdf) | [β
](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-PPOOffPolicy) | | | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-PPOOffPolicy) | | | | | |
|
38 |
| [PG](https://proceedings.neurips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf) | β³ | | [β
](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-PG) | | | | β³ | | |
|
39 |
| [A2C](https://arxiv.org/pdf/1602.01783.pdf) | [β
](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-A2C) | [β
](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-A2C) | | | | | [β
](https://huggingface.co/OpenDILabCommunity/Hopper-v3-A2C) | | |
|
40 |
| [IMPALA](https://arxiv.org/pdf/1802.01561.pdf) | β³ | | | | | | β³ | | |
|
|
|
34 |
| Algo.\Env. | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [BipedalWalker](https://di-engine-docs.readthedocs.io/en/latest/13_envs/bipedalwalker.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [SpaceInvaders](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Qbert](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Hopper](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) | [Halfcheetah](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) | [Walker2d](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) |
|
35 |
| :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
|
36 |
| [PPO](https://arxiv.org/pdf/1707.06347.pdf) | [β
](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-PPO) | | | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-PPO) | [β
](https://huggingface.co/OpenDILabCommunity/SpaceInvadersNoFrameskip-v4-PPO) | [β
](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-PPO) | [β
](https://huggingface.co/OpenDILabCommunity/Hopper-v3-PPO) | | |
|
37 |
+
| [PPO(Offpolicy)](https://arxiv.org/pdf/1707.06347.pdf) | [β
](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-PPOOffPolicy) | | | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-PPOOffPolicy) | | [β
](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-PPOOffPolicy) | | | |
|
38 |
| [PG](https://proceedings.neurips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf) | β³ | | [β
](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-PG) | | | | β³ | | |
|
39 |
| [A2C](https://arxiv.org/pdf/1602.01783.pdf) | [β
](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-A2C) | [β
](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-A2C) | | | | | [β
](https://huggingface.co/OpenDILabCommunity/Hopper-v3-A2C) | | |
|
40 |
| [IMPALA](https://arxiv.org/pdf/1802.01561.pdf) | β³ | | | | | | β³ | | |
|