small error
This commit is contained in:
parent
8be30e9413
commit
672294af5f
@ -1,6 +1,6 @@
|
|||||||
# Trust Region-Guided Proximal Policy Optimization
|
# Trust Region-Guided Proximal Policy Optimization
|
||||||
|
|
||||||
Source code for the paper: [Truly Proximal Policy Optmization (TRGPPO)](https://arxiv.org/abs/1901.10314). The original code was forked from [OpenAI baselines](https://github.com/openai/baselines).
|
Source code for the paper: [Truly Proximal Policy Optmization](https://arxiv.org/abs/1903.07940). The original code was forked from [OpenAI baselines](https://github.com/openai/baselines).
|
||||||
|
|
||||||
Method is tested on [MuJoCo](http://www.mujoco.org/) continuous control tasks and [Atari](https://www.atari.com/) discrete game tasks in [OpenAI gym](https://github.com/openai/gym).
|
Method is tested on [MuJoCo](http://www.mujoco.org/) continuous control tasks and [Atari](https://www.atari.com/) discrete game tasks in [OpenAI gym](https://github.com/openai/gym).
|
||||||
Networks are trained using [tensorflow1.10](https://www.tensorflow.org/) and Python 3.6.
|
Networks are trained using [tensorflow1.10](https://www.tensorflow.org/) and Python 3.6.
|
||||||
|
Loading…
x
Reference in New Issue
Block a user