Anas BELFADIL
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							53e6b0408d
							
						
					 | 
					
						
						
							
							Add BranchingDQN for large discrete action spaces (#618)
						
						
						
						
						
						
					 | 
					
						2022-05-15 21:40:32 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Alex Nikulkov
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							92456cdb68
							
						
					 | 
					
						
						
							
							Add learning rate scheduler to BasePolicy (#598)
						
						
						
						
						
						
					 | 
					
						2022-04-17 23:52:30 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								ChenDRAG
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							c25926dd8f
							
						
					 | 
					
						
						
							
							Formalize variable names (#509)
						
						
						
						
						
						
						
						Co-authored-by: Jiayi Weng <trinkle23897@gmail.com> 
						
						
					 | 
					
						2022-01-30 00:53:56 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Yi Su
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							a59d96d041
							
						
					 | 
					
						
						
							
							Add Intrinsic Curiosity Module (#503)
						
						
						
						
						
						
					 | 
					
						2022-01-15 02:43:48 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								n+e
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							fc251ab0b8
							
						
					 | 
					
						
						
							
							bump to v0.4.3 (#432)
						
						
						
						
						
						
						
						* add makefile
* bump version
* add isort and yapf
* update contributing.md
* update PR template
* spelling check 
						
						
					 | 
					
						2021-09-03 05:05:04 +08:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								ChenDRAG
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							dd4a01132c
							
						
					 | 
					
						
						
							
							Fix SAC loss explode (#333)
						
						
						
						
						
						
						
						* change SAC action_bound_method to "clip" (tanh is hardcoded in forward)
* docstring update
* modelbase -> modelbased 
						
						
					 | 
					
						2021-04-04 17:33:35 +08:00 | 
					
					
						
						
							
							
							
						
					 |