René's URL Explorer Experiment

Title: reward-learning · GitHub Topics · GitHub

Open Graph Title: Build software better, together

X Title: GitHub

Description: GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

Open Graph Description: GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

X Description: GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

Opengraph URL: https://github.com

X: github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern	/topics/:topic_name(.:format)
route-controller	topics
route-action	show
fetch-nonce	v2:417afb42-a041-8950-f7b9-e2de46a4807e
current-catalog-service-hash	82c569b93da5c18ed649ebd4c2c79437db4611a6a1373e805a3cb001c64130b7
request-id	927E:133EFF:28D3CF5:37FE56B:698C94E2
html-safe-nonce	d0b985992bb468e75933801a6a455b5ec1e8aec7e9a58b3b92180cd8e8d77754
visitor-payload	eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI5MjdFOjEzM0VGRjoyOEQzQ0Y1OjM3RkU1NkI6Njk4Qzk0RTIiLCJ2aXNpdG9yX2lkIjoiNDY5NzI0MjgxNjcwNjU0ODk2MiIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac	8406c7bae96993ad12f73b3708179f6218a3654060443dd775262c83f4f1e130
github-keyboard-shortcuts	copilot
google-site-verification	Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-url	https://collector.github.com/github/collect
fb:app_id	1401488693436528
apple-itunes-app	app-id=1477376905, app-argument=https://github.com/topics/reward-learning
og:site_name	GitHub
og:image	https://github.githubassets.com/assets/github-octocat-13c86b8b336d.png
og:image:type	image/png
og:image:width	1200
og:image:height	620
twitter:site:id	13334762
twitter:creator	github
twitter:creator:id	13334762
twitter:card	summary_large_image
twitter:image	https://github.githubassets.com/assets/github-logo-55c5b9a1fe52.png
twitter:image:width	1200
twitter:image:height	1200
hostname	github.com
expected-hostname	github.com
None	640eeb7b6ff4d8d106235d228c0c286e82592d4d2403227b5b2b4fc5832297a4
turbo-cache-control	no-preview
turbo-body-classes	logged-out env-production page-responsive
disable-turbo	false
browser-stats-url	https://api.github.com/_private/browser/stats
browser-errors-url	https://api.github.com/_private/browser/errors
release	3d444f0a47beeeac94cddbb51c91ab408befe8d4
ui-target	full
theme-color	#1e2327
color-scheme	light dark

Links:

Skip to content	https://patch-diff.githubusercontent.com/topics/reward-learning#start-of-content
	https://patch-diff.githubusercontent.com/
Sign in	https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftopics%2Freward-learning
GitHub CopilotWrite better code with AI	https://github.com/features/copilot
GitHub SparkBuild and deploy intelligent apps	https://github.com/features/spark
GitHub ModelsManage and compare prompts	https://github.com/features/models
MCP RegistryNewIntegrate external tools	https://github.com/mcp
ActionsAutomate any workflow	https://github.com/features/actions
CodespacesInstant dev environments	https://github.com/features/codespaces
IssuesPlan and track work	https://github.com/features/issues
Code ReviewManage code changes	https://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilities	https://github.com/security/advanced-security
Code securitySecure your code as you build	https://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they start	https://github.com/security/advanced-security/secret-protection
Why GitHub	https://github.com/why-github
Documentation	https://docs.github.com
Blog	https://github.blog
Changelog	https://github.blog/changelog
Marketplace	https://github.com/marketplace
View all features	https://github.com/features
Enterprises	https://github.com/enterprise
Small and medium teams	https://github.com/team
Startups	https://github.com/enterprise/startups
Nonprofits	https://github.com/solutions/industry/nonprofits
App Modernization	https://github.com/solutions/use-case/app-modernization
DevSecOps	https://github.com/solutions/use-case/devsecops
DevOps	https://github.com/solutions/use-case/devops
CI/CD	https://github.com/solutions/use-case/ci-cd
View all use cases	https://github.com/solutions/use-case
Healthcare	https://github.com/solutions/industry/healthcare
Financial services	https://github.com/solutions/industry/financial-services
Manufacturing	https://github.com/solutions/industry/manufacturing
Government	https://github.com/solutions/industry/government
View all industries	https://github.com/solutions/industry
View all solutions	https://github.com/solutions
AI	https://github.com/resources/articles?topic=ai
Software Development	https://github.com/resources/articles?topic=software-development
DevOps	https://github.com/resources/articles?topic=devops
Security	https://github.com/resources/articles?topic=security
View all topics	https://github.com/resources/articles
Customer stories	https://github.com/customer-stories
Events & webinars	https://github.com/resources/events
Ebooks & reports	https://github.com/resources/whitepapers
Business insights	https://github.com/solutions/executive-insights
GitHub Skills	https://skills.github.com
Documentation	https://docs.github.com
Customer support	https://support.github.com
Community forum	https://github.com/orgs/community/discussions
Trust center	https://github.com/trust-center
Partners	https://github.com/partners
GitHub SponsorsFund open source developers	https://github.com/sponsors
Security Lab	https://securitylab.github.com
Maintainer Community	https://maintainers.github.com
Accelerator	https://github.com/accelerator
Archive Program	https://archiveprogram.github.com
Topics	https://github.com/topics
Trending	https://github.com/trending
Collections	https://github.com/collections
Enterprise platformAI-powered developer platform	https://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security features	https://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI features	https://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 support	https://github.com/premium-support
Pricing	https://github.com/pricing
Search syntax tips	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentation	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in	https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftopics%2Freward-learning
Sign up	https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2Ftopics%2Freward-learning&source=header
Reload	https://patch-diff.githubusercontent.com/topics/reward-learning
Reload	https://patch-diff.githubusercontent.com/topics/reward-learning
Reload	https://patch-diff.githubusercontent.com/topics/reward-learning
Explore	https://patch-diff.githubusercontent.com/explore
Topics	https://patch-diff.githubusercontent.com/topics
Trending	https://patch-diff.githubusercontent.com/trending
Collections	https://patch-diff.githubusercontent.com/collections
Events	https://patch-diff.githubusercontent.com/events
GitHub Sponsors	https://patch-diff.githubusercontent.com/sponsors/explore
Star	https://patch-diff.githubusercontent.com/login?return_to=%2Ftopic.reward-learning
All 10	https://github.com/topics/reward-learning
Python 6	https://github.com/topics/reward-learning?l=python
Jupyter Notebook 1	https://github.com/topics/reward-learning?l=jupyter+notebook
NetLogo 1	https://github.com/topics/reward-learning?l=netlogo
XSLT 1	https://github.com/topics/reward-learning?l=xslt
HumanCompatibleAI	https://patch-diff.githubusercontent.com/HumanCompatibleAI
imitation	https://patch-diff.githubusercontent.com/HumanCompatibleAI/imitation
Star 1.7k	https://patch-diff.githubusercontent.com/login?return_to=%2FHumanCompatibleAI%2Fimitation
Code	https://patch-diff.githubusercontent.com/HumanCompatibleAI/imitation
Issues	https://patch-diff.githubusercontent.com/HumanCompatibleAI/imitation/issues
Pull requests	https://patch-diff.githubusercontent.com/HumanCompatibleAI/imitation/pulls
imitation-learning	https://patch-diff.githubusercontent.com/topics/imitation-learning
gymnasium	https://patch-diff.githubusercontent.com/topics/gymnasium
inverse-reinforcement-learning	https://patch-diff.githubusercontent.com/topics/inverse-reinforcement-learning
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
snap-stanford	https://patch-diff.githubusercontent.com/snap-stanford
optimas	https://patch-diff.githubusercontent.com/snap-stanford/optimas
Star 73	https://patch-diff.githubusercontent.com/login?return_to=%2Fsnap-stanford%2Foptimas
Code	https://patch-diff.githubusercontent.com/snap-stanford/optimas
Issues	https://patch-diff.githubusercontent.com/snap-stanford/optimas/issues
Pull requests	https://patch-diff.githubusercontent.com/snap-stanford/optimas/pulls
optimization	https://patch-diff.githubusercontent.com/topics/optimization
multiagent-systems	https://patch-diff.githubusercontent.com/topics/multiagent-systems
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
compound-ai-systems	https://patch-diff.githubusercontent.com/topics/compound-ai-systems
bobxwu	https://patch-diff.githubusercontent.com/bobxwu
learning-from-rewards-llm-papers	https://patch-diff.githubusercontent.com/bobxwu/learning-from-rewards-llm-papers
Star 63	https://patch-diff.githubusercontent.com/login?return_to=%2Fbobxwu%2Flearning-from-rewards-llm-papers
Code	https://patch-diff.githubusercontent.com/bobxwu/learning-from-rewards-llm-papers
Issues	https://patch-diff.githubusercontent.com/bobxwu/learning-from-rewards-llm-papers/issues
Pull requests	https://patch-diff.githubusercontent.com/bobxwu/learning-from-rewards-llm-papers/pulls
reinforcement-learning	https://patch-diff.githubusercontent.com/topics/reinforcement-learning
post-training	https://patch-diff.githubusercontent.com/topics/post-training
self-correction	https://patch-diff.githubusercontent.com/topics/self-correction
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
large-language-models	https://patch-diff.githubusercontent.com/topics/large-language-models
llm	https://patch-diff.githubusercontent.com/topics/llm
llms	https://patch-diff.githubusercontent.com/topics/llms
reward-models	https://patch-diff.githubusercontent.com/topics/reward-models
reward-model	https://patch-diff.githubusercontent.com/topics/reward-model
reward-modeling	https://patch-diff.githubusercontent.com/topics/reward-modeling
guided-decoding	https://patch-diff.githubusercontent.com/topics/guided-decoding
test-time-scaling	https://patch-diff.githubusercontent.com/topics/test-time-scaling
csmile-1006	https://patch-diff.githubusercontent.com/csmile-1006
REDS_agent	https://patch-diff.githubusercontent.com/csmile-1006/REDS_agent
Star 18	https://patch-diff.githubusercontent.com/login?return_to=%2Fcsmile-1006%2FREDS_agent
Code	https://patch-diff.githubusercontent.com/csmile-1006/REDS_agent
Issues	https://patch-diff.githubusercontent.com/csmile-1006/REDS_agent/issues
Pull requests	https://patch-diff.githubusercontent.com/csmile-1006/REDS_agent/pulls
reinforcement-learning	https://patch-diff.githubusercontent.com/topics/reinforcement-learning
visual-reinforcement-learning	https://patch-diff.githubusercontent.com/topics/visual-reinforcement-learning
reward-shaping	https://patch-diff.githubusercontent.com/topics/reward-shaping
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
reward-models	https://patch-diff.githubusercontent.com/topics/reward-models
HumanCompatibleAI	https://patch-diff.githubusercontent.com/HumanCompatibleAI
interpreting-rewards	https://patch-diff.githubusercontent.com/HumanCompatibleAI/interpreting-rewards
Star 10	https://patch-diff.githubusercontent.com/login?return_to=%2FHumanCompatibleAI%2Finterpreting-rewards
Code	https://patch-diff.githubusercontent.com/HumanCompatibleAI/interpreting-rewards
Issues	https://patch-diff.githubusercontent.com/HumanCompatibleAI/interpreting-rewards/issues
Pull requests	https://patch-diff.githubusercontent.com/HumanCompatibleAI/interpreting-rewards/pulls
deep-reinforcement-learning	https://patch-diff.githubusercontent.com/topics/deep-reinforcement-learning
interpretability	https://patch-diff.githubusercontent.com/topics/interpretability
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
Masoudjafaripour	https://patch-diff.githubusercontent.com/Masoudjafaripour
OnlineRLHF	https://patch-diff.githubusercontent.com/Masoudjafaripour/OnlineRLHF
Star 6	https://patch-diff.githubusercontent.com/login?return_to=%2FMasoudjafaripour%2FOnlineRLHF
Code	https://patch-diff.githubusercontent.com/Masoudjafaripour/OnlineRLHF
Issues	https://patch-diff.githubusercontent.com/Masoudjafaripour/OnlineRLHF/issues
Pull requests	https://patch-diff.githubusercontent.com/Masoudjafaripour/OnlineRLHF/pulls
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
pbrl	https://patch-diff.githubusercontent.com/topics/pbrl
rlhf	https://patch-diff.githubusercontent.com/topics/rlhf
preference-based-reinforcement-learning	https://patch-diff.githubusercontent.com/topics/preference-based-reinforcement-learning
Entience	https://patch-diff.githubusercontent.com/Entience
ASIMOV	https://patch-diff.githubusercontent.com/Entience/ASIMOV
Star 5	https://patch-diff.githubusercontent.com/login?return_to=%2FEntience%2FASIMOV
Code	https://patch-diff.githubusercontent.com/Entience/ASIMOV
Issues	https://patch-diff.githubusercontent.com/Entience/ASIMOV/issues
Pull requests	https://patch-diff.githubusercontent.com/Entience/ASIMOV/pulls
addiction	https://patch-diff.githubusercontent.com/topics/addiction
foraging	https://patch-diff.githubusercontent.com/topics/foraging
agent-based-simulation	https://patch-diff.githubusercontent.com/topics/agent-based-simulation
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
homeostatic-plasticity	https://patch-diff.githubusercontent.com/topics/homeostatic-plasticity
ethanvillalovoz	https://patch-diff.githubusercontent.com/ethanvillalovoz
clarification-guided-reward-learning	https://patch-diff.githubusercontent.com/ethanvillalovoz/clarification-guided-reward-learning
Star 0	https://patch-diff.githubusercontent.com/login?return_to=%2Fethanvillalovoz%2Fclarification-guided-reward-learning
Code	https://patch-diff.githubusercontent.com/ethanvillalovoz/clarification-guided-reward-learning
Issues	https://patch-diff.githubusercontent.com/ethanvillalovoz/clarification-guided-reward-learning/issues
Pull requests	https://patch-diff.githubusercontent.com/ethanvillalovoz/clarification-guided-reward-learning/pulls
robotics	https://patch-diff.githubusercontent.com/topics/robotics
bayesian-inference	https://patch-diff.githubusercontent.com/topics/bayesian-inference
human-robot-interaction	https://patch-diff.githubusercontent.com/topics/human-robot-interaction
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
clarification-questions	https://patch-diff.githubusercontent.com/topics/clarification-questions
caitlin-leonard	https://patch-diff.githubusercontent.com/caitlin-leonard
pacman-rl-agent	https://patch-diff.githubusercontent.com/caitlin-leonard/pacman-rl-agent
Star 0	https://patch-diff.githubusercontent.com/login?return_to=%2Fcaitlin-leonard%2Fpacman-rl-agent
Code	https://patch-diff.githubusercontent.com/caitlin-leonard/pacman-rl-agent
Issues	https://patch-diff.githubusercontent.com/caitlin-leonard/pacman-rl-agent/issues
Pull requests	https://patch-diff.githubusercontent.com/caitlin-leonard/pacman-rl-agent/pulls
python	https://patch-diff.githubusercontent.com/topics/python
machine-learning	https://patch-diff.githubusercontent.com/topics/machine-learning
reinforcement-learning	https://patch-diff.githubusercontent.com/topics/reinforcement-learning
pacman	https://patch-diff.githubusercontent.com/topics/pacman
tkinter-gui	https://patch-diff.githubusercontent.com/topics/tkinter-gui
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
rl-agent	https://patch-diff.githubusercontent.com/topics/rl-agent
NBCLab	https://patch-diff.githubusercontent.com/NBCLab
probabilistic-selection-task	https://patch-diff.githubusercontent.com/NBCLab/probabilistic-selection-task
Star 0	https://patch-diff.githubusercontent.com/login?return_to=%2FNBCLab%2Fprobabilistic-selection-task
Code	https://patch-diff.githubusercontent.com/NBCLab/probabilistic-selection-task
Issues	https://patch-diff.githubusercontent.com/NBCLab/probabilistic-selection-task/issues
Pull requests	https://patch-diff.githubusercontent.com/NBCLab/probabilistic-selection-task/pulls
neuroimaging	https://patch-diff.githubusercontent.com/topics/neuroimaging
fmri	https://patch-diff.githubusercontent.com/topics/fmri
eprime	https://patch-diff.githubusercontent.com/topics/eprime
reward-learning	https://patch-diff.githubusercontent.com/topics/reward-learning
behavioral-task	https://patch-diff.githubusercontent.com/topics/behavioral-task
Curate this topic	https://github.com/github/explore/tree/master/CONTRIBUTING.md?source=add-description-reward-learning
Learn more	https://docs.github.com/en/articles/classifying-your-repository-with-topics
	https://github.com
Terms	https://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacy	https://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Security	https://github.com/security
Status	https://www.githubstatus.com/
Community	https://github.community/
Docs	https://docs.github.com/
Contact	https://support.github.com?tags=dotcom-footer

Viewport: width=device-width

URLs of crawlers that visited me.