Trinkle23897
be9ce44290
fix #59
2020-05-29 11:49:47 +08:00
Trinkle23897
de556fd22d
item3 of #51
2020-05-27 11:02:23 +08:00
Trinkle23897
0eef0ca198
fix optional type syntax
2020-05-16 20:08:32 +08:00
Trinkle23897
9b26137cd2
add type annotation
2020-05-12 11:31:47 +08:00
Trinkle23897
075825325e
add preprocess_fn ( #42 )
2020-05-05 13:39:51 +08:00
Trinkle23897
134f787e24
reserve 'policy' keyword in replay buffer
2020-04-29 17:48:48 +08:00
Trinkle23897
4fd826761c
enable null buffer in test collector
2020-04-20 11:50:18 +08:00
Trinkle23897
7b65d43394
vanilla imitation learning
2020-04-13 19:37:27 +08:00
Trinkle23897
74407e13da
env info log_fn ( #28 )
2020-04-10 18:02:05 +08:00
Trinkle23897
13086b7f64
add ignore_obs_next in buffer
2020-04-10 09:01:17 +08:00
Trinkle23897
6da80e045a
fix rnn ( #19 ), add __repr__, and fix #26
2020-04-09 19:53:45 +08:00
Trinkle23897
86572c66d4
maybe finished rnn?
2020-04-08 21:13:15 +08:00
Trinkle23897
e0809ff135
add policy docs ( #21 )
2020-04-06 19:36:59 +08:00
Trinkle23897
610390c132
add docs of collector and trainer ( #20 )
2020-04-05 18:34:45 +08:00
Oblivion
4d4d0daf9e
Performance improve ( #18 )
...
* improve performance
set one thread for NN
replace detach() op with torch.no_grad()
* fix pep 8 errors
2020-04-05 09:10:21 +08:00
Trinkle23897
f23b0dfac9
add ListReplayBuffer
2020-03-28 15:14:41 +08:00
Trinkle23897
c42990c725
add rllib result and fix pep8
2020-03-28 09:43:35 +08:00
Minghao Zhang
77068af526
add examples, fix some bugs ( #5 )
...
* update atari.py
* fix setup.py
pass the pytest
* fix setup.py
pass the pytest
* add args "render"
* change the tensorboard writter
* change the tensorboard writter
* change device, render, tensorboard log location
* change device, render, tensorboard log location
* remove some wrong local files
* fix some tab mistakes and the envs name in continuous/test_xx.py
* add examples and point robot maze environment
* fix some bugs during testing examples
* add dqn network and fix some args
* change back the tensorboard writter's frequency to ensure ppo and a2c can write things normally
* add a warning to collector
* rm some unrelated files
* reformat
* fix a bug in test_dqn due to the model wrong selection
2020-03-28 07:27:18 +08:00
Trinkle23897
44f911bc31
add pytorch drl result
2020-03-27 09:04:29 +08:00
Minghao Zhang
3c0a09fefd
minor reformat ( #2 )
...
* update atari.py
* fix setup.py
pass the pytest
* fix setup.py
pass the pytest
2020-03-26 09:01:20 +08:00
Trinkle23897
fdc969b830
fix collector
2020-03-25 14:08:28 +08:00
Trinkle23897
75364cd986
ppo and early stop
2020-03-20 19:52:29 +08:00
Trinkle23897
c87fe3c18c
add trainer
2020-03-19 17:23:46 +08:00
Trinkle23897
64bab0b6a0
ddpg
2020-03-18 21:45:41 +08:00
Trinkle23897
fd621971e5
fix bug in test
2020-03-17 15:16:30 +08:00
Trinkle23897
8b0b970c9b
add speed stat
2020-03-16 15:04:58 +08:00
Trinkle23897
cef5de8b83
fix some bugs
2020-03-16 11:11:29 +08:00
Trinkle23897
5983c6b33d
finish dqn
2020-03-15 17:41:00 +08:00
Trinkle23897
c804662457
add cache buf in collector
2020-03-14 21:48:31 +08:00
Trinkle23897
543e57cdbd
clear
2020-03-13 21:47:17 +08:00
Trinkle23897
f16e05c0e7
maybe finished collector?
2020-03-13 17:49:22 +08:00
Trinkle23897
f58c1397c6
half of collector
2020-03-12 22:20:33 +08:00