diff --git a/README.md b/README.md index b22a049..8342378 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,4 @@ -# Trust Region-Guided Proximal Policy Optimization +# Truly Proximal Policy Optimization Source code for the paper: [Truly Proximal Policy Optmization](https://arxiv.org/abs/1903.07940). The original code was forked from [OpenAI baselines](https://github.com/openai/baselines).