Tianshou

hongshaorou/Tianshou

Fork 0

Commit Graph

Author	SHA1	Message	Date
n+e	710966eda7	change API of train_fn and test_fn (#229 ) train_fn(epoch) -> train_fn(epoch, num_env_step) test_fn(epoch) -> test_fn(epoch, num_env_step)	2020-09-26 16:35:37 +08:00
n+e	380e9e911d	fix atari examples (#206 )	2020-09-06 23:05:33 +08:00
yingchengyang	5b49192a48	DQN Atari examples (#187 ) This PR aims to provide the script of Atari DQN setting: - A speedrun of PongNoFrameskip-v4 (finished, about half an hour in i7-8750 + GTX1060 with 1M environment steps) - A general script for all atari game Since we use multiple env for simulation, the result is slightly different from the original paper, but consider to be acceptable. It also adds another parameter save_only_last_obs for replay buffer in order to save the memory. Co-authored-by: Trinkle23897 <463003665@qq.com>	2020-08-30 05:48:09 +08:00

Author

SHA1

Message

Date

n+e

710966eda7

change API of train_fn and test_fn (#229 )

train_fn(epoch) -> train_fn(epoch, num_env_step)
test_fn(epoch) -> test_fn(epoch, num_env_step)

2020-09-26 16:35:37 +08:00

n+e

380e9e911d

fix atari examples (#206 )

2020-09-06 23:05:33 +08:00

yingchengyang

5b49192a48

DQN Atari examples (#187 )

This PR aims to provide the script of Atari DQN setting:
- A speedrun of PongNoFrameskip-v4 (finished, about half an hour in i7-8750 + GTX1060 with 1M environment steps)
- A general script for all atari game
Since we use multiple env for simulation, the result is slightly different from the original paper, but consider to be acceptable.

It also adds another parameter save_only_last_obs for replay buffer in order to save the memory.

Co-authored-by: Trinkle23897 <463003665@qq.com>

2020-08-30 05:48:09 +08:00

3 Commits