# Changes ## Dependencies - New extra "eval" ## Api Extension - `Experiment` and `ExperimentConfig` now have a `name`, that can however be overridden when `Experiment.run()` is called - When building an `Experiment` from an `ExperimentConfig`, the user has the option to add info about seeds to the name. - New method in `ExperimentConfig` called `build_default_seeded_experiments` - `SamplingConfig` has an explicit training seed, `test_seed` is inferred. - New `evaluation` package for repeating the same experiment with multiple seeds and aggregating the results (important extension!). Currently in alpha state. - Loggers can now restore the logged data into python by using the new `restore_logged_data` ## Breaking Changes - `AtariEnvFactory` (in examples) now receives explicit train and test seeds - `EnvFactoryRegistered` now requires an explicit `test_seed` - `BaseLogger.prepare_dict_for_logging` is now abstract --------- Co-authored-by: Maximilian Huettenrauch <m.huettenrauch@appliedai.de> Co-authored-by: Michael Panchenko <m.panchenko@appliedai.de> Co-authored-by: Michael Panchenko <35432522+MischaPanch@users.noreply.github.com>
269 lines
2.0 KiB
Plaintext
269 lines
2.0 KiB
Plaintext
tianshou
|
|
arXiv
|
|
tanh
|
|
lr
|
|
logits
|
|
env
|
|
envs
|
|
optim
|
|
eps
|
|
timelimit
|
|
TimeLimit
|
|
envpool
|
|
EnvPool
|
|
maxsize
|
|
timestep
|
|
timesteps
|
|
numpy
|
|
ndarray
|
|
stackoverflow
|
|
tensorboard
|
|
state_dict
|
|
len
|
|
tac
|
|
fqf
|
|
iqn
|
|
qrdqn
|
|
rl
|
|
offpolicy
|
|
onpolicy
|
|
quantile
|
|
quantiles
|
|
dqn
|
|
param
|
|
async
|
|
subprocess
|
|
deque
|
|
nn
|
|
equ
|
|
cql
|
|
fn
|
|
boolean
|
|
pre
|
|
np
|
|
cuda
|
|
rnn
|
|
rew
|
|
pre
|
|
perceptron
|
|
bsz
|
|
dataset
|
|
mujoco
|
|
jit
|
|
nstep
|
|
preprocess
|
|
preprocessing
|
|
repo
|
|
ReLU
|
|
namespace
|
|
recv
|
|
th
|
|
utils
|
|
NaN
|
|
linesearch
|
|
hyperparameters
|
|
pseudocode
|
|
entropies
|
|
nn
|
|
config
|
|
cpu
|
|
rms
|
|
debias
|
|
indice
|
|
regularizer
|
|
miniblock
|
|
modularize
|
|
serializable
|
|
softmax
|
|
vectorized
|
|
optimizers
|
|
undiscounted
|
|
submodule
|
|
subclasses
|
|
submodules
|
|
tfevent
|
|
dirichlet
|
|
docstring
|
|
webpage
|
|
formatter
|
|
num
|
|
py
|
|
pythonic
|
|
中文文档位于
|
|
conda
|
|
miniconda
|
|
Amir
|
|
Andreas
|
|
Antonoglou
|
|
Beattie
|
|
Bellemare
|
|
Charles
|
|
Daan
|
|
Demis
|
|
Dharshan
|
|
Fidjeland
|
|
Georg
|
|
Hassabis
|
|
Helen
|
|
Ioannis
|
|
Kavukcuoglu
|
|
King
|
|
Koray
|
|
Kumaran
|
|
Legg
|
|
Mnih
|
|
Ostrovski
|
|
Petersen
|
|
Riedmiller
|
|
Rusu
|
|
Sadik
|
|
Shane
|
|
Stig
|
|
Veness
|
|
Volodymyr
|
|
Wierstra
|
|
Lillicrap
|
|
Pritzel
|
|
Heess
|
|
Erez
|
|
Yuval
|
|
Tassa
|
|
Schulman
|
|
Filip
|
|
Wolski
|
|
Prafulla
|
|
Dhariwal
|
|
Radford
|
|
Oleg
|
|
Klimov
|
|
Kaichao
|
|
Jiayi
|
|
Weng
|
|
Duburcq
|
|
Huayu
|
|
Yi
|
|
Su
|
|
Strens
|
|
Ornstein
|
|
Uhlenbeck
|
|
mse
|
|
gail
|
|
airl
|
|
ppo
|
|
Jupyter
|
|
Colab
|
|
Colaboratory
|
|
IPendulum
|
|
Reacher
|
|
Runtime
|
|
Nvidia
|
|
Enduro
|
|
Qbert
|
|
Seaquest
|
|
subnets
|
|
subprocesses
|
|
isort
|
|
yapf
|
|
pydocstyle
|
|
Args
|
|
tuples
|
|
tuple
|
|
Multi
|
|
multi
|
|
parameterized
|
|
Proximal
|
|
metadata
|
|
GPU
|
|
Dopamine
|
|
builtin
|
|
params
|
|
inplace
|
|
deepcopy
|
|
Gaussian
|
|
stdout
|
|
parallelization
|
|
minibatch
|
|
minibatches
|
|
MLP
|
|
backpropagation
|
|
dataclass
|
|
superset
|
|
subtype
|
|
subdirectory
|
|
picklable
|
|
ShmemVectorEnv
|
|
Github
|
|
wandb
|
|
jupyter
|
|
img
|
|
src
|
|
parallelized
|
|
infty
|
|
venv
|
|
venvs
|
|
subproc
|
|
bcq
|
|
highlevel
|
|
icm
|
|
modelbased
|
|
td
|
|
psrl
|
|
ddpg
|
|
npg
|
|
tf
|
|
trpo
|
|
crr
|
|
pettingzoo
|
|
multidiscrete
|
|
vecbuf
|
|
prio
|
|
colab
|
|
segtree
|
|
multiagent
|
|
mapolicy
|
|
sensai
|
|
sensAI
|
|
docstrings
|
|
superclass
|
|
iterable
|
|
functools
|
|
str
|
|
sklearn
|
|
attr
|
|
bc
|
|
redq
|
|
modelfree
|
|
bdq
|
|
util
|
|
logp
|
|
autogenerated
|
|
subpackage
|
|
subpackages
|
|
recurse
|
|
rollout
|
|
rollouts
|
|
prepend
|
|
prepends
|
|
dict
|
|
dicts
|
|
pytorch
|
|
tensordict
|
|
onwards
|
|
Dominik
|
|
Tsinghua
|
|
Tianshou
|
|
appliedAI
|
|
macOS
|
|
joblib
|
|
master
|
|
Panchenko
|
|
BA
|
|
BH
|
|
BO
|
|
BD
|
|
configs
|
|
postfix
|
|
backend
|
|
rliable
|
|
hl
|