René's URL Explorer Experiment


Title: GitHub - RL-code-lib/tianshou: An elegant PyTorch deep reinforcement learning platform.

Open Graph Title: GitHub - RL-code-lib/tianshou: An elegant PyTorch deep reinforcement learning platform.

X Title: GitHub - RL-code-lib/tianshou: An elegant PyTorch deep reinforcement learning platform.

Description: An elegant PyTorch deep reinforcement learning platform. - RL-code-lib/tianshou

Open Graph Description: An elegant PyTorch deep reinforcement learning platform. - RL-code-lib/tianshou

X Description: An elegant PyTorch deep reinforcement learning platform. - RL-code-lib/tianshou

Opengraph URL: https://github.com/RL-code-lib/tianshou

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:b2e463d2-4e9a-3cef-5e55-1685271a15ad
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idBFE0:79213:6474A:80AE0:6991C31D
html-safe-noncef9b03b4859e35f56f664c61285e80798a7eb7803692b8825b41d95195e677435
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCRkUwOjc5MjEzOjY0NzRBOjgwQUUwOjY5OTFDMzFEIiwidmlzaXRvcl9pZCI6IjUyMTYxODUxOTk0Nzg1NTU0MjEiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmac3300ab2901546a01e35c608ffc3b0c9110db1681d65c901f9f83c70af605ab1a
hovercard-subject-tagrepository:358905797
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/RL-code-lib/tianshou
twitter:imagehttps://opengraph.githubassets.com/0b4fb33acb8fb9ea683bfbe1e60b4c3d5099173057ccccdaa7d2e16f4b1d1e16/RL-code-lib/tianshou
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/0b4fb33acb8fb9ea683bfbe1e60b4c3d5099173057ccccdaa7d2e16f4b1d1e16/RL-code-lib/tianshou
og:image:altAn elegant PyTorch deep reinforcement learning platform. - RL-code-lib/tianshou
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None42c603b9d642c4a9065a51770f75e5e27132fef0e858607f5c9cb7e422831a7b
turbo-cache-controlno-preview
go-importgithub.com/RL-code-lib/tianshou git https://github.com/RL-code-lib/tianshou.git
octolytics-dimension-user_id82717011
octolytics-dimension-user_loginRL-code-lib
octolytics-dimension-repository_id358905797
octolytics-dimension-repository_nwoRL-code-lib/tianshou
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id129815042
octolytics-dimension-repository_parent_nwothu-ml/tianshou
octolytics-dimension-repository_network_root_id129815042
octolytics-dimension-repository_network_root_nwothu-ml/tianshou
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release848bc6032dcc93a9a7301dcc3f379a72ba13b96e
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FRL-code-lib%2Ftianshou
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FRL-code-lib%2Ftianshou
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=RL-code-lib%2Ftianshou
Reloadhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou
Reloadhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou
Reloadhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou
RL-code-lib https://patch-diff.githubusercontent.com/RL-code-lib
tianshouhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou
thu-ml/tianshouhttps://patch-diff.githubusercontent.com/thu-ml/tianshou
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FRL-code-lib%2Ftianshou
Fork 0 https://patch-diff.githubusercontent.com/login?return_to=%2FRL-code-lib%2Ftianshou
Star 0 https://patch-diff.githubusercontent.com/login?return_to=%2FRL-code-lib%2Ftianshou
tianshou.readthedocs.iohttps://tianshou.readthedocs.io
MIT license https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/LICENSE
0 stars https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/stargazers
1.3k forks https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/forks
Branches https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/branches
Tags https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tags
Activity https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2FRL-code-lib%2Ftianshou
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FRL-code-lib%2Ftianshou
Code https://patch-diff.githubusercontent.com/RL-code-lib/tianshou
Pull requests 0 https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/pulls
Actions https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/actions
Projects 0 https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/projects
Security 0 https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/security
Insights https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/pulse
Code https://patch-diff.githubusercontent.com/RL-code-lib/tianshou
Pull requests https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/pulls
Actions https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/actions
Projects https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/projects
Security https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/security
Insights https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/pulse
Brancheshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/branches
Tagshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tags
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/branches
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tags
250 Commitshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/commits/master/
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/commits/master/
.githubhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/.github
.githubhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/.github
docshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/docs
docshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/docs
exampleshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/examples
exampleshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/examples
testhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/test
testhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/test
tianshouhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/tianshou
tianshouhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tree/master/tianshou
.gitignorehttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/.gitignore
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/CONTRIBUTING.md
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/CONTRIBUTING.md
LICENSEhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/LICENSE
MANIFEST.inhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/MANIFEST.in
MANIFEST.inhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/MANIFEST.in
README.mdhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/README.md
README.mdhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/README.md
setup.cfghttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/setup.cfg
setup.cfghttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/setup.cfg
setup.pyhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/setup.py
setup.pyhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/setup.py
READMEhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou
Contributinghttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou
MIT licensehttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou
http://tianshou.readthedocs.io
https://pypi.org/project/tianshou/
https://github.com/conda-forge/tianshou-feedstock
https://tianshou.readthedocs.io/en/latest
https://tianshou.readthedocs.io/zh/latest/
https://github.com/thu-ml/tianshou/actions
https://codecov.io/gh/thu-ml/tianshou
https://github.com/thu-ml/tianshou/issues
https://github.com/thu-ml/tianshou/stargazers
https://github.com/thu-ml/tianshou/network
https://github.com/thu-ml/tianshou/blob/master/LICENSE
https://gitter.im/thu-ml/tianshou?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge&utm_content=badge
天授https://baike.baidu.com/item/%E5%A4%A9%E6%8E%88
Deep Q-Network (DQN)https://storage.googleapis.com/deepmind-media/dqn/DQNNaturePaper.pdf
Double DQNhttps://arxiv.org/pdf/1509.06461.pdf
Dueling DQNhttps://arxiv.org/pdf/1511.06581.pdf
Categorical DQN (C51)https://arxiv.org/pdf/1707.06887.pdf
Quantile Regression DQN (QRDQN)https://arxiv.org/pdf/1710.10044.pdf
Policy Gradient (PG)https://papers.nips.cc/paper/1713-policy-gradient-methods-for-reinforcement-learning-with-function-approximation.pdf
Advantage Actor-Critic (A2C)https://openai.com/blog/baselines-acktr-a2c/
Trust Region Policy Optimizationhttps://arxiv.org/pdf/1502.05477.pdf
Proximal Policy Optimization (PPO)https://arxiv.org/pdf/1707.06347.pdf
Deep Deterministic Policy Gradient (DDPG)https://arxiv.org/pdf/1509.02971.pdf
Twin Delayed DDPG (TD3)https://arxiv.org/pdf/1802.09477.pdf
Soft Actor-Critic (SAC)https://arxiv.org/pdf/1812.05905.pdf
Discrete Soft Actor-Critic (SAC-Discrete)https://arxiv.org/pdf/1910.07207.pdf
Discrete Batch-Constrained deep Q-Learning (BCQ-Discrete)https://arxiv.org/pdf/1910.01708.pdf
Prioritized Experience Replay (PER)https://arxiv.org/pdf/1511.05952.pdf
Generalized Advantage Estimator (GAE)https://arxiv.org/pdf/1506.02438.pdf
Posterior Sampling Reinforcement Learning (PSRL)https://www.ece.uvic.ca/~bctill/papers/learning/Strens_2000.pdf
MuJoCo benchmarkhttps://github.com/thu-ml/tianshou/tree/master/examples/mujoco
Usagehttps://tianshou.readthedocs.io/en/latest/tutorials/cheatsheet.html#parallel-sampling
Usagehttps://tianshou.readthedocs.io/en/latest/tutorials/cheatsheet.html#rnn-style-training
Usagehttps://tianshou.readthedocs.io/en/latest/tutorials/cheatsheet.html#user-defined-environment-and-different-state-representation
Usagehttps://tianshou.readthedocs.io/en/latest/tutorials/cheatsheet.html#customize-training-process
Usagehttps://tianshou.readthedocs.io/en/latest/tutorials/cheatsheet.html##multi-agent-reinforcement-learning
unit testshttps://github.com/thu-ml/tianshou/actions
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#installation
PyPIhttps://pypi.org/project/tianshou/
conda-forgehttps://github.com/conda-forge/tianshou-feedstock
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#documentation
tianshou.readthedocs.iohttps://tianshou.readthedocs.io/
test/https://github.com/thu-ml/tianshou/blob/master/test
examples/https://github.com/thu-ml/tianshou/blob/master/examples
https://tianshou.readthedocs.io/zh/latest/https://tianshou.readthedocs.io/zh/latest/
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#why-tianshou
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#fast-speed
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/docs/_static/images/testpg.gif
Tianshouhttps://github.com/thu-ml/tianshou
Baselineshttps://github.com/openai/baselines
Stable-Baselineshttps://github.com/hill-a/stable-baselines
Ray/RLlibhttps://github.com/ray-project/ray/tree/master/rllib/
PyTorch-DRLhttps://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
rlpythttps://github.com/astooke/rlpyt
https://github.com/thu-ml/tianshou/stargazers
https://github.com/openai/baselines/stargazers
https://github.com/hill-a/stable-baselines/stargazers
https://github.com/ray-project/ray/stargazers
https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/stargazers
https://github.com/astooke/rlpyt/stargazers
herehttps://github.com/astooke/rlpyt/issues/135
examples/atari/https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/examples/atari
examples/mujoco/https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/blob/master/examples/mujoco
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#reproducible
GitHub Actionshttps://github.com/thu-ml/tianshou/actions
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#modularized-policy
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#elegant-and-flexible
documentationhttps://tianshou.readthedocs.io
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#quick-start
test/discrete/test_dqn.pyhttps://github.com/thu-ml/tianshou/blob/master/test/discrete/test_dqn.py
documentationhttps://tianshou.readthedocs.io
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#contributing
this linkhttps://tianshou.readthedocs.io/en/latest/contributing.html
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#todo
Projecthttps://github.com/thu-ml/tianshou/projects
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#citing-tianshou
https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#acknowledgment
privhttps://github.com/thu-ml/tianshou/tree/priv
Haosheng Zouhttps://github.com/HaoshengZou
TSAILhttp://ml.cs.tsinghua.edu.cn/
Institute for Artificial Intelligence, Tsinghua Universityhttp://ml.cs.tsinghua.edu.cn/thuai/
tianshou.readthedocs.iohttps://tianshou.readthedocs.io
Readme https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#readme-ov-file
MIT license https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#MIT-1-ov-file
Contributing https://patch-diff.githubusercontent.com/RL-code-lib/tianshou#contributing-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou
Activityhttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/activity
Custom propertieshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/custom-properties
0 starshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/stargazers
0 watchinghttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/watchers
0 forkshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FRL-code-lib%2Ftianshou&report=RL-code-lib+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/RL-code-lib/tianshou/releases
15 tags https://patch-diff.githubusercontent.com/RL-code-lib/tianshou/tags
Packages 0https://patch-diff.githubusercontent.com/orgs/RL-code-lib/packages?repo_name=tianshou
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.