From 672294af5f91361b79dc463f67373f091514b723 Mon Sep 17 00:00:00 2001 From: wangyuhuix Date: Fri, 17 Jan 2020 12:31:37 +0800 Subject: [PATCH] small error --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 746edef..d4f8bd4 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ # Trust Region-Guided Proximal Policy Optimization -Source code for the paper: [Truly Proximal Policy Optmization (TRGPPO)](https://arxiv.org/abs/1901.10314). The original code was forked from [OpenAI baselines](https://github.com/openai/baselines). +Source code for the paper: [Truly Proximal Policy Optmization](https://arxiv.org/abs/1903.07940). The original code was forked from [OpenAI baselines](https://github.com/openai/baselines). Method is tested on [MuJoCo](http://www.mujoco.org/) continuous control tasks and [Atari](https://www.atari.com/) discrete game tasks in [OpenAI gym](https://github.com/openai/gym). Networks are trained using [tensorflow1.10](https://www.tensorflow.org/) and Python 3.6.