Changes: - Disclaimer in README - Replaced all occurences of Gym with Gymnasium - Removed code that is now dead since we no longer need to support the old step API - Updated type hints to only allow new step API - Increased required version of envpool to support Gymnasium - Increased required version of PettingZoo to support Gymnasium - Updated `PettingZooEnv` to only use the new step API, removed hack to also support old API - I had to add some `# type: ignore` comments, due to new type hinting in Gymnasium. I'm not that familiar with type hinting but I believe that the issue is on the Gymnasium side and we are looking into it. - Had to update `MyTestEnv` to support `options` kwarg - Skip NNI tests because they still use OpenAI Gym - Also allow `PettingZooEnv` in vector environment - Updated doc page about ReplayBuffer to also talk about terminated and truncated flags. Still need to do: - Update the Jupyter notebooks in docs - Check the entire code base for more dead code (from compatibility stuff) - Check the reset functions of all environments/wrappers in code base to make sure they use the `options` kwarg - Someone might want to check test_env_finite.py - Is it okay to allow `PettingZooEnv` in vector environments? Might need to update docs?
169 lines
1.2 KiB
Plaintext
169 lines
1.2 KiB
Plaintext
tianshou
|
|
arXiv
|
|
tanh
|
|
lr
|
|
logits
|
|
env
|
|
envs
|
|
optim
|
|
eps
|
|
timelimit
|
|
TimeLimit
|
|
envpool
|
|
EnvPool
|
|
maxsize
|
|
timestep
|
|
timesteps
|
|
numpy
|
|
ndarray
|
|
stackoverflow
|
|
tensorboard
|
|
state_dict
|
|
len
|
|
tac
|
|
fqf
|
|
iqn
|
|
qrdqn
|
|
rl
|
|
offpolicy
|
|
onpolicy
|
|
quantile
|
|
quantiles
|
|
dqn
|
|
param
|
|
async
|
|
subprocess
|
|
deque
|
|
nn
|
|
equ
|
|
cql
|
|
fn
|
|
boolean
|
|
pre
|
|
np
|
|
cuda
|
|
rnn
|
|
rew
|
|
pre
|
|
perceptron
|
|
bsz
|
|
dataset
|
|
mujoco
|
|
jit
|
|
nstep
|
|
preprocess
|
|
preprocessing
|
|
repo
|
|
ReLU
|
|
namespace
|
|
recv
|
|
th
|
|
utils
|
|
NaN
|
|
linesearch
|
|
hyperparameters
|
|
pseudocode
|
|
entropies
|
|
nn
|
|
config
|
|
cpu
|
|
rms
|
|
debias
|
|
indice
|
|
regularizer
|
|
miniblock
|
|
modularize
|
|
serializable
|
|
softmax
|
|
vectorized
|
|
optimizers
|
|
undiscounted
|
|
submodule
|
|
subclasses
|
|
submodules
|
|
tfevent
|
|
dirichlet
|
|
docstring
|
|
webpage
|
|
formatter
|
|
num
|
|
py
|
|
pythonic
|
|
中文文档位于
|
|
conda
|
|
miniconda
|
|
Amir
|
|
Andreas
|
|
Antonoglou
|
|
Beattie
|
|
Bellemare
|
|
Charles
|
|
Daan
|
|
Demis
|
|
Dharshan
|
|
Fidjeland
|
|
Georg
|
|
Hassabis
|
|
Helen
|
|
Ioannis
|
|
Kavukcuoglu
|
|
King
|
|
Koray
|
|
Kumaran
|
|
Legg
|
|
Mnih
|
|
Ostrovski
|
|
Petersen
|
|
Riedmiller
|
|
Rusu
|
|
Sadik
|
|
Shane
|
|
Stig
|
|
Veness
|
|
Volodymyr
|
|
Wierstra
|
|
Lillicrap
|
|
Pritzel
|
|
Heess
|
|
Erez
|
|
Yuval
|
|
Tassa
|
|
Schulman
|
|
Filip
|
|
Wolski
|
|
Prafulla
|
|
Dhariwal
|
|
Radford
|
|
Oleg
|
|
Klimov
|
|
Kaichao
|
|
Jiayi
|
|
Weng
|
|
Duburcq
|
|
Huayu
|
|
Yi
|
|
Su
|
|
Strens
|
|
Ornstein
|
|
Uhlenbeck
|
|
mse
|
|
gail
|
|
airl
|
|
ppo
|
|
Jupyter
|
|
Colab
|
|
Colaboratory
|
|
IPendulum
|
|
Reacher
|
|
Runtime
|
|
Nvidia
|
|
Enduro
|
|
Qbert
|
|
Seaquest
|
|
subnets
|
|
subprocesses
|
|
isort
|
|
yapf
|
|
pydocstyle
|
|
Args
|