Maximilian Huettenrauch
dffe8cddf6
fix pandas dependency
2024-03-26 14:40:08 +01:00
Maximilian Huettenrauch
5a3f2291c2
added pandas dependency
2024-03-26 14:37:43 +01:00
Maximilian Huettenrauch
85204b175c
added matplotlib dependency
2024-03-26 14:32:42 +01:00
Maximilian Huettenrauch
2e3f0b5e99
move doc string
2024-03-26 14:26:27 +01:00
Maximilian Huettenrauch
d9a201754c
updates
2024-03-26 14:23:54 +01:00
Maximilian Huettenrauch
516c956d58
Merge branch 'thuml_master' into feature/algo-eval
2024-03-25 10:32:42 +01:00
Michael Panchenko
61bf9adaff
Update CHANGELOG.md [skip ci]
2024-03-20 23:09:26 +01:00
Michael Panchenko
5f96a57bbb
Add CHANGELOG.md
2024-03-20 23:08:34 +01:00
Michael Panchenko
1a4d7deca6
Update publish.yaml, typo [skip ci[
v1.0.0
2024-03-20 00:41:46 +01:00
Michael Panchenko
72df9a580d
Update publish.yaml [skip ci]
2024-03-20 00:41:17 +01:00
Michael Panchenko
55e9bee373
Update publish.yaml [skip ci]
2024-03-20 00:39:54 +01:00
Michael Panchenko
e3661c11e3
Update publish.yaml, missing / [skip ci]
2024-03-20 00:26:11 +01:00
Maximilian Huettenrauch
5259d5f3fb
Merge branch 'thuml_master' into feature/algo-eval
...
# Conflicts:
# examples/mujoco/mujoco_env.py
2024-03-15 09:42:17 +01:00
maxhuettenrauch
e82379c47f
Allow explicit setting of multiprocessing context for SubprocEnvWorker ( #1072 )
...
Running multiple training runs in parallel (with, for example, joblib)
fails on macOS due to a change in the standard context for
multiprocessing (see
[here](https://stackoverflow.com/questions/65098398/why-using-fork-works-but-using-spawn-fails-in-python3-8-multiprocessing )
or
[here](https://www.reddit.com/r/learnpython/comments/g5372v/multiprocessing_with_fork_on_macos/ )).
This PR adds the ability to explicitly set a multiprocessing context for
the SubProcEnvWorker (similar to gymnasium's
[AsyncVecEnv](https://github.com/Farama-Foundation/Gymnasium/blob/main/gymnasium/vector/async_vector_env.py )).
---------
Co-authored-by: Maximilian Huettenrauch <m.huettenrauch@appliedai.de>
Co-authored-by: Michael Panchenko <35432522+MischaPanch@users.noreply.github.com>
2024-03-14 11:07:56 +01:00
Maximilian Huettenrauch
a7898b15b8
small fix
2024-03-12 15:17:33 +01:00
Maximilian Huettenrauch
d9a612a997
format, type check and small fixes
2024-03-12 15:01:50 +01:00
Maximilian Huettenrauch
f730782f29
Merge branch 'thuml_master' into feature/algo-eval
2024-03-12 11:46:08 +01:00
Maximilian Huettenrauch
6c1bd85521
add mujoco example with multiple runs and performance plots
2024-03-12 11:44:48 +01:00
Maximilian Huettenrauch
5762d2c2e0
extend hl experiment builder
2024-03-12 11:43:52 +01:00
Maximilian Huettenrauch
734119ec00
logger updates
2024-03-12 11:31:41 +01:00
Maximilian Huettenrauch
32cd3b4357
logger updates
...
- introduced logger manager
- loggers can reload logged data from disk
2024-03-11 10:29:17 +01:00
Dominik Jain
1714c7f2c7
High-level API: Fix number of test episodes being incorrectly scaled by number of envs ( #1071 )
2024-03-07 08:57:11 -08:00
Maximilian Huettenrauch
95cbfe6cdf
added explicit env seeding for train and test envs
2024-03-06 17:09:06 +01:00
Michael Panchenko
6746a80f6d
Add publish workflow, first preparation for next release ( #1067 )
2024-03-04 12:21:49 +01:00
Michael Panchenko
fdb69f1273
Improve README, minor changes in procedural example ( #1068 )
2024-03-03 15:07:07 +01:00
Dominik Jain
b6b2c95ac7
Improve README, minor changes in procedural example
2024-03-03 15:06:40 +01:00
Erni
1aee41fa9c
Using dist.mode instead of logits.argmax ( #1066 )
...
changed all the occurrences where an action is selected deterministically
- **from**: using the outputs of the actor network.
- **to**: using the mode of the PyTorch distribution.
---------
Co-authored-by: Arnau Jimenez <arnau.jimenez@zeiss.com>
2024-03-03 00:09:39 +01:00
maxhuettenrauch
7c970df53f
Fix/add watch env with obs rms ( #1061 )
...
Supports deciding whether to watch the agent performing on the env using high-level interfaces
2024-02-29 15:59:11 +01:00
Dominik Jain
49781e715e
Fix high-level examples ( #1060 )
...
The high-level examples were all broken by changes made to make mypy
pass.
This PR fixes them, making a type change in logging.run_cli instead to
make mypy happy.
2024-02-23 23:17:14 +01:00
Ashok Arora
0b61bf8caf
Fix the link to the contributing guide. ( #1062 )
2024-02-23 23:15:41 +01:00
Carlo Cagnetta
ce371ae736
remove old python versions from poetry classifier ( #1059 )
2024-02-21 15:27:53 +01:00
Michael Panchenko
9b6cb6903e
Improvements in High-Level API and Poe Tasks ( #1055 )
...
* Add an option to SamplingConfig which allows to configure number of
test episodes
* Make OptimizerFactory more flexible, adding method
`create_optimizer_for_params`
* Fix AutoAlphaFactoryDefault using hard-coded Adam optimizer
* Fix mypy issues that were platform/installation-dependent
* Limit scope of nbqa, resolving issues with files generated by old
versions of the build
Fixes #1054
2024-02-15 12:02:16 +01:00
Dominik Jain
26e210a6ae
Apply nbqa only to the docs/ folder and exclude the (old) jupyter_execute folder
2024-02-15 11:39:45 +01:00
Dominik Jain
08728ad35e
Resolve platform-specific/installation-specific mypy issues
...
by adding ignores and ignoring unused ignores locally
2024-02-15 11:26:54 +01:00
Dominik Jain
f2e0fd165d
Fix gitignore applying to tianshou/env on platfoms with case-insensitive file system
2024-02-15 11:26:39 +01:00
Dominik Jain
eeb2081ca6
Fix AutoAlphaFactoryDefault using hard-coded Adam optimizer instead of passed factory
2024-02-14 20:43:38 +01:00
Dominik Jain
76cbd7efc2
Make OptimizerFactory more flexible by adding a second method which
...
allows the creation of an optimizer given arbitrary parameters
(rather than a module)
2024-02-14 20:42:06 +01:00
Dominik Jain
bf391853dc
Allow to configure number of test episodes in high-level API
2024-02-14 19:14:28 +01:00
Michael Panchenko
8742e3645c
Docs, js - typo in path
2024-02-14 10:50:06 +01:00
Michael Panchenko
5cc51145da
Docs/hotfix ( #1052 )
2024-02-12 18:54:38 +01:00
Michael Panchenko
7a30b842b6
Add vega scripts explictly to config ( #1051 )
2024-02-12 18:49:32 +01:00
Michael Panchenko
d3fe87b70d
Docs: added symlinks for paths resolution, removed jquery loading ( #1050 )
2024-02-12 17:38:25 +01:00
Michael Panchenko
e3c610d37c
Docs: Added jquery, better handling of js files through sphinx config… ( #1049 )
...
Closes #1005 #1045
2024-02-12 15:43:32 +01:00
Michael Panchenko
33d241a29b
Docs/html doc issues ( #1048 )
...
Closes #1005
## Main changes
2. Load vega-embed things using jupyter-book config
3. Add vega-embed dependencies as part of local code for offline
development
4. Reduced duplication in benchmark.js
5. Update sphinx, docutils, and jupyter-book
Co-authored-by: carlocagnetta <c.cagnetta@appliedai.de>
2024-02-09 19:43:10 +01:00
Carlo Cagnetta
5fc314bd4b
Docs/use nbqa on notebooks ( #1041 )
...
- Added nbqa to pyproject.toml
- Resolved mypy issues on notebooks and related files
- Conducting ruff checks on notebooks
- Add DataclassPPrintMixin for better stats representation
- Improved Notebooks wording and explanations
Resolve : #1004
Related to #974
2024-02-07 17:28:16 +01:00
maxhuettenrauch
5fe9aea798
Update and fix dependencies related to mac install ( #1044 )
...
Addresses part of #1015
### Dependencies
- move jsonargparse and docstring-parser to dependencies to run hl
examples without dev
- create mujoco-py extra for legacy mujoco envs
- updated atari extra
- removed atari-py and gym dependencies
- added ALE-py, autorom, and shimmy
- created robotics extra for HER-DDPG
### Mac specific
- only install envpool when not on mac
- mujoco-py not working on macOS newer than Monterey
(https://github.com/openai/mujoco-py/issues/777 )
- D4RL also fails due to dependency on mujoco-py
(https://github.com/Farama-Foundation/D4RL/issues/232 )
### Other
- reduced training-num/test-num in example files to a number ≤ 20
(examples with 100 led to too many open files)
- rendering for Mujoco envs needs to be fixed on gymnasium side
(https://github.com/Farama-Foundation/Gymnasium/issues/749 )
---------
Co-authored-by: Maximilian Huettenrauch <m.huettenrauch@appliedai.de>
Co-authored-by: Michael Panchenko <35432522+MischaPanch@users.noreply.github.com>
2024-02-06 17:06:38 +01:00
Daniel Plop
eb0215cf76
Refactoring/mypy issues test ( #1017 )
...
Improves typing in examples and tests, towards mypy passing there.
Introduces the SpaceInfo utility
2024-02-06 14:24:30 +01:00
Michael Panchenko
4756ee80ff
Fixed links and added poetry install info in README [skip ci]
2024-01-24 18:07:02 +01:00
Michael Panchenko
a0e0824c64
Typo in README.md [skip ci]
2024-01-24 17:50:15 +01:00
dependabot[bot]
69a0e6bbd7
Bump notebook from 7.0.6 to 7.0.7 ( #1027 )
...
[//]: # (dependabot-start)
⚠️ **Dependabot is rebasing this PR** ⚠️
Rebasing might not happen immediately, so don't worry if this takes some
time.
Note: if you make any changes to this PR yourself, they will take
precedence over the rebase.
---
[//]: # (dependabot-end)
Bumps [notebook](https://github.com/jupyter/notebook ) from 7.0.6 to
7.0.7.
<details>
<summary>Release notes</summary>
<p><em>Sourced from <a
href="https://github.com/jupyter/notebook/releases ">notebook's
releases</a>.</em></p>
<blockquote>
<h2>v7.0.7</h2>
<h2>7.0.7</h2>
<p>(<a
href="https://github.com/jupyter/notebook/compare/@jupyter-notebook/application-extension@7.0.6...089c78c48fd00b2b0d2f33e4463eb42018e86803 ">Full
Changelog</a>)</p>
<h3>Enhancements made</h3>
<ul>
<li>Update to JupyterLab 4.0.11 <a
href="https://redirect.github.com/jupyter/notebook/pull/7215 ">#7215</a>
(<a
href="https://github.com/krassowski "><code>@krassowski</code></a>)</li>
</ul>
<h3>Maintenance and upkeep improvements</h3>
<ul>
<li>Update ruff config and typing <a
href="https://redirect.github.com/jupyter/notebook/pull/7145 ">#7145</a>
(<a
href="https://github.com/blink1073 "><code>@blink1073</code></a>)</li>
<li>Clean up lint handling <a
href="https://redirect.github.com/jupyter/notebook/pull/7142 ">#7142</a>
(<a
href="https://github.com/blink1073 "><code>@blink1073</code></a>)</li>
<li>Adopt ruff format <a
href="https://redirect.github.com/jupyter/notebook/pull/7132 ">#7132</a>
(<a
href="https://github.com/blink1073 "><code>@blink1073</code></a>)</li>
<li>[7.0.x] Install stable JupyterLab 4.0 in the releaser hook <a
href="https://redirect.github.com/jupyter/notebook/pull/7183 ">#7183</a>
(<a href="https://github.com/jtpio "><code>@jtpio</code></a>)</li>
<li>Update publish-release workflow for PyPI trusted publisher <a
href="https://redirect.github.com/jupyter/notebook/pull/7176 ">#7176</a>
(<a href="https://github.com/jtpio "><code>@jtpio</code></a>)</li>
</ul>
<h3>Contributors to this release</h3>
<p>(<a
href="https://github.com/jupyter/notebook/graphs/contributors?from=2023-10-17&to=2024-01-19&type=c ">GitHub
contributors page for this release</a>)</p>
<p><a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Abrichet+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@brichet</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Ad5423197+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@d5423197</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Agithub-actions+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@github-actions</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Ajtpio+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@jtpio</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Akrassowski+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@krassowski</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Ameeseeksmachine+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@meeseeksmachine</code></a></p>
</blockquote>
</details>
<details>
<summary>Changelog</summary>
<p><em>Sourced from <a
href="https://github.com/jupyter/notebook/blob/@jupyter-notebook/tree@7.0.7/CHANGELOG.md ">notebook's
changelog</a>.</em></p>
<blockquote>
<h2>7.0.7</h2>
<p>(<a
href="https://github.com/jupyter/notebook/compare/@jupyter-notebook/application-extension@7.0.6...089c78c48fd00b2b0d2f33e4463eb42018e86803 ">Full
Changelog</a>)</p>
<h3>Enhancements made</h3>
<ul>
<li>Update to JupyterLab 4.0.11 <a
href="https://redirect.github.com/jupyter/notebook/pull/7215 ">#7215</a>
(<a
href="https://github.com/krassowski "><code>@krassowski</code></a>)</li>
</ul>
<h3>Maintenance and upkeep improvements</h3>
<ul>
<li>Update ruff config and typing <a
href="https://redirect.github.com/jupyter/notebook/pull/7145 ">#7145</a>
(<a
href="https://github.com/blink1073 "><code>@blink1073</code></a>)</li>
<li>Clean up lint handling <a
href="https://redirect.github.com/jupyter/notebook/pull/7142 ">#7142</a>
(<a
href="https://github.com/blink1073 "><code>@blink1073</code></a>)</li>
<li>Adopt ruff format <a
href="https://redirect.github.com/jupyter/notebook/pull/7132 ">#7132</a>
(<a
href="https://github.com/blink1073 "><code>@blink1073</code></a>)</li>
<li>[7.0.x] Install stable JupyterLab 4.0 in the releaser hook <a
href="https://redirect.github.com/jupyter/notebook/pull/7183 ">#7183</a>
(<a href="https://github.com/jtpio "><code>@jtpio</code></a>)</li>
<li>Update publish-release workflow for PyPI trusted publisher <a
href="https://redirect.github.com/jupyter/notebook/pull/7176 ">#7176</a>
(<a href="https://github.com/jtpio "><code>@jtpio</code></a>)</li>
</ul>
<h3>Contributors to this release</h3>
<p>(<a
href="https://github.com/jupyter/notebook/graphs/contributors?from=2023-10-17&to=2024-01-19&type=c ">GitHub
contributors page for this release</a>)</p>
<p><a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Abrichet+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@brichet</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Ad5423197+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@d5423197</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Agithub-actions+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@github-actions</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Ajtpio+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@jtpio</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Akrassowski+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@krassowski</code></a>
| <a
href="https://github.com/search?q=repo%3Ajupyter%2Fnotebook+involves%3Ameeseeksmachine+updated%3A2023-10-17..2024-01-19&type=Issues "><code>@meeseeksmachine</code></a></p>
<!-- raw HTML omitted -->
</blockquote>
</details>
<details>
<summary>Commits</summary>
<ul>
<li><a
href="80e992e9f4
"><code>80e992e</code></a>
Publish 7.0.7</li>
<li><a
href="089c78c48f
"><code>089c78c</code></a>
Update to JupyterLab 4.0.11 (<a
href="https://redirect.github.com/jupyter/notebook/issues/7215 ">#7215</a>)</li>
<li><a
href="109ba75788
"><code>109ba75</code></a>
Backport PR <a
href="https://redirect.github.com/jupyter/notebook/issues/7176 ">#7176</a>:
Update publish-release workflow for PyPI trusted publisher...</li>
<li><a
href="d252423198
"><code>d252423</code></a>
Update ruff config and typing (<a
href="https://redirect.github.com/jupyter/notebook/issues/7145 ">#7145</a>)
(<a
href="https://redirect.github.com/jupyter/notebook/issues/7186 ">#7186</a>)</li>
<li><a
href="d2ef92f0b3
"><code>d2ef92f</code></a>
Backport PR <a
href="https://redirect.github.com/jupyter/notebook/issues/7142 ">#7142</a>:
Clean up lint handling (<a
href="https://redirect.github.com/jupyter/notebook/issues/7185 ">#7185</a>)</li>
<li><a
href="8e9390d9af
"><code>8e9390d</code></a>
Backport PR <a
href="https://redirect.github.com/jupyter/notebook/issues/7132 ">#7132</a>:
Adopt ruff format (<a
href="https://redirect.github.com/jupyter/notebook/issues/7184 ">#7184</a>)</li>
<li><a
href="4d07f1ee9b
"><code>4d07f1e</code></a>
Install stable JupyterLab 4.0 in the releaser hook (<a
href="https://redirect.github.com/jupyter/notebook/issues/7183 ">#7183</a>)</li>
<li><a
href="e73d410074
"><code>e73d410</code></a>
Updated ui-tests Configuration in Contributing.md (<a
href="https://redirect.github.com/jupyter/notebook/issues/7124 ">#7124</a>)</li>
<li><a
href="ea1a1538ef
"><code>ea1a153</code></a>
Set <code>navigation_with_keys</code> to <code>False</code> (<a
href="https://redirect.github.com/jupyter/notebook/issues/7129 ">#7129</a>)</li>
<li><a
href="d717c6b361
"><code>d717c6b</code></a>
Add Python 3.12 classifier (<a
href="https://redirect.github.com/jupyter/notebook/issues/7111 ">#7111</a>)</li>
<li>See full diff in <a
href="https://github.com/jupyter/notebook/compare/@jupyter-notebook/tree@7.0.6...@jupyter-notebook/tree@7.0.7 ">compare
view</a></li>
</ul>
</details>
<br />
[](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores )
Dependabot will resolve any conflicts with this PR as long as you don't
alter it yourself. You can also trigger a rebase manually by commenting
`@dependabot rebase`.
[//]: # (dependabot-automerge-start)
[//]: # (dependabot-automerge-end)
---
<details>
<summary>Dependabot commands and options</summary>
<br />
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits
that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after
your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge
and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating
it. You can achieve the same result by closing it manually
- `@dependabot show <dependency name> ignore conditions` will show all
of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop
Dependabot creating any more for this major version (unless you reopen
the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop
Dependabot creating any more for this minor version (unless you reopen
the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop
Dependabot creating any more for this dependency (unless you reopen the
PR or upgrade to it yourself)
You can disable automated security fix PRs for this repo from the
[Security Alerts
page](https://github.com/thu-ml/tianshou/network/alerts ).
</details>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-01-23 19:48:58 +01:00