René's URL Explorer Experiment


Title: GitHub - dreadnode/AIRTBench-Code: Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models

Open Graph Title: GitHub - dreadnode/AIRTBench-Code: Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models

X Title: GitHub - dreadnode/AIRTBench-Code: Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models

Description: Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models - dreadnode/AIRTBench-Code

Open Graph Description: Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models - dreadnode/AIRTBench-Code

X Description: Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models - dreadnode/AIRTBench-Code

Opengraph URL: https://github.com/dreadnode/AIRTBench-Code

X: @github

direct link

Domain: github.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:7a792440-6660-76e2-f439-743f167e9ef9
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idC24C:3D089B:21DBCC4:2F0DF5D:696FF30A
html-safe-noncea077b8a5791d73ccc0140e83d91b938ce7820f5c95eea1151fb7083ac5051675
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJDMjRDOjNEMDg5QjoyMURCQ0M0OjJGMERGNUQ6Njk2RkYzMEEiLCJ2aXNpdG9yX2lkIjoiNzA3ODMzNjA2NjcxNTkwNjgyNiIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmacff7f0f4b0a483d03fd4b5a05c2a10cbbc031abd9740a637394e44ba772cd5097
hovercard-subject-tagrepository:996088065
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/dreadnode/AIRTBench-Code
twitter:imagehttps://opengraph.githubassets.com/f38ec1cb75cb73d4a1c142fdd36c1efb5bb911d20c3d1d8159bac5b92e4606c5/dreadnode/AIRTBench-Code
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/f38ec1cb75cb73d4a1c142fdd36c1efb5bb911d20c3d1d8159bac5b92e4606c5/dreadnode/AIRTBench-Code
og:image:altCode Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models - dreadnode/AIRTBench-Code
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None2b218dbdee134592a2dbfabd454a1070986f1fbedb8334bf06b8f2ccc3449130
turbo-cache-controlno-preview
go-importgithub.com/dreadnode/AIRTBench-Code git https://github.com/dreadnode/AIRTBench-Code.git
octolytics-dimension-user_id350839
octolytics-dimension-user_logindreadnode
octolytics-dimension-repository_id996088065
octolytics-dimension-repository_nwodreadnode/AIRTBench-Code
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id996088065
octolytics-dimension-repository_network_root_nwodreadnode/AIRTBench-Code
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
releasebcaac379a58a3ed99a1b0502e2a8f5cfd3a7b54b
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/dreadnode/AIRTBench-Code#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fdreadnode%2FAIRTBench-Code
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fdreadnode%2FAIRTBench-Code
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=dreadnode%2FAIRTBench-Code
Reloadhttps://github.com/dreadnode/AIRTBench-Code
Reloadhttps://github.com/dreadnode/AIRTBench-Code
Reloadhttps://github.com/dreadnode/AIRTBench-Code
dreadnode https://github.com/dreadnode
AIRTBench-Codehttps://github.com/dreadnode/AIRTBench-Code
Notifications https://github.com/login?return_to=%2Fdreadnode%2FAIRTBench-Code
Fork 13 https://github.com/login?return_to=%2Fdreadnode%2FAIRTBench-Code
Star 92 https://github.com/login?return_to=%2Fdreadnode%2FAIRTBench-Code
arxiv.org/abs/2506.14682https://arxiv.org/abs/2506.14682
Apache-2.0 license https://github.com/dreadnode/AIRTBench-Code/blob/main/LICENSE
92 stars https://github.com/dreadnode/AIRTBench-Code/stargazers
13 forks https://github.com/dreadnode/AIRTBench-Code/forks
Branches https://github.com/dreadnode/AIRTBench-Code/branches
Tags https://github.com/dreadnode/AIRTBench-Code/tags
Activity https://github.com/dreadnode/AIRTBench-Code/activity
Star https://github.com/login?return_to=%2Fdreadnode%2FAIRTBench-Code
Notifications https://github.com/login?return_to=%2Fdreadnode%2FAIRTBench-Code
Code https://github.com/dreadnode/AIRTBench-Code
Issues 1 https://github.com/dreadnode/AIRTBench-Code/issues
Pull requests 0 https://github.com/dreadnode/AIRTBench-Code/pulls
Actions https://github.com/dreadnode/AIRTBench-Code/actions
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/dreadnode/AIRTBench-Code/security
Please reload this pagehttps://github.com/dreadnode/AIRTBench-Code
Insights https://github.com/dreadnode/AIRTBench-Code/pulse
Code https://github.com/dreadnode/AIRTBench-Code
Issues https://github.com/dreadnode/AIRTBench-Code/issues
Pull requests https://github.com/dreadnode/AIRTBench-Code/pulls
Actions https://github.com/dreadnode/AIRTBench-Code/actions
Security https://github.com/dreadnode/AIRTBench-Code/security
Insights https://github.com/dreadnode/AIRTBench-Code/pulse
Brancheshttps://github.com/dreadnode/AIRTBench-Code/branches
Tagshttps://github.com/dreadnode/AIRTBench-Code/tags
https://github.com/dreadnode/AIRTBench-Code/branches
https://github.com/dreadnode/AIRTBench-Code/tags
128 Commitshttps://github.com/dreadnode/AIRTBench-Code/commits/main/
https://github.com/dreadnode/AIRTBench-Code/commits/main/
.githubhttps://github.com/dreadnode/AIRTBench-Code/tree/main/.github
.githubhttps://github.com/dreadnode/AIRTBench-Code/tree/main/.github
.hookshttps://github.com/dreadnode/AIRTBench-Code/tree/main/.hooks
.hookshttps://github.com/dreadnode/AIRTBench-Code/tree/main/.hooks
.vscodehttps://github.com/dreadnode/AIRTBench-Code/tree/main/.vscode
.vscodehttps://github.com/dreadnode/AIRTBench-Code/tree/main/.vscode
airtbenchhttps://github.com/dreadnode/AIRTBench-Code/tree/main/airtbench
airtbenchhttps://github.com/dreadnode/AIRTBench-Code/tree/main/airtbench
assetshttps://github.com/dreadnode/AIRTBench-Code/tree/main/assets
assetshttps://github.com/dreadnode/AIRTBench-Code/tree/main/assets
datasethttps://github.com/dreadnode/AIRTBench-Code/tree/main/dataset
datasethttps://github.com/dreadnode/AIRTBench-Code/tree/main/dataset
docshttps://github.com/dreadnode/AIRTBench-Code/tree/main/docs
docshttps://github.com/dreadnode/AIRTBench-Code/tree/main/docs
notebookshttps://github.com/dreadnode/AIRTBench-Code/tree/main/notebooks
notebookshttps://github.com/dreadnode/AIRTBench-Code/tree/main/notebooks
runshttps://github.com/dreadnode/AIRTBench-Code/tree/main/runs
runshttps://github.com/dreadnode/AIRTBench-Code/tree/main/runs
.editorconfighttps://github.com/dreadnode/AIRTBench-Code/blob/main/.editorconfig
.editorconfighttps://github.com/dreadnode/AIRTBench-Code/blob/main/.editorconfig
.env.examplehttps://github.com/dreadnode/AIRTBench-Code/blob/main/.env.example
.env.examplehttps://github.com/dreadnode/AIRTBench-Code/blob/main/.env.example
.gitattributeshttps://github.com/dreadnode/AIRTBench-Code/blob/main/.gitattributes
.gitattributeshttps://github.com/dreadnode/AIRTBench-Code/blob/main/.gitattributes
.gitignorehttps://github.com/dreadnode/AIRTBench-Code/blob/main/.gitignore
.gitignorehttps://github.com/dreadnode/AIRTBench-Code/blob/main/.gitignore
.pre-commit-config.yamlhttps://github.com/dreadnode/AIRTBench-Code/blob/main/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://github.com/dreadnode/AIRTBench-Code/blob/main/.pre-commit-config.yaml
.secrets.baselinehttps://github.com/dreadnode/AIRTBench-Code/blob/main/.secrets.baseline
.secrets.baselinehttps://github.com/dreadnode/AIRTBench-Code/blob/main/.secrets.baseline
CODEOWNERShttps://github.com/dreadnode/AIRTBench-Code/blob/main/CODEOWNERS
CODEOWNERShttps://github.com/dreadnode/AIRTBench-Code/blob/main/CODEOWNERS
LICENSEhttps://github.com/dreadnode/AIRTBench-Code/blob/main/LICENSE
LICENSEhttps://github.com/dreadnode/AIRTBench-Code/blob/main/LICENSE
README.mdhttps://github.com/dreadnode/AIRTBench-Code/blob/main/README.md
README.mdhttps://github.com/dreadnode/AIRTBench-Code/blob/main/README.md
RENOVATE_TESTING.mdhttps://github.com/dreadnode/AIRTBench-Code/blob/main/RENOVATE_TESTING.md
RENOVATE_TESTING.mdhttps://github.com/dreadnode/AIRTBench-Code/blob/main/RENOVATE_TESTING.md
SECURITY.mdhttps://github.com/dreadnode/AIRTBench-Code/blob/main/SECURITY.md
SECURITY.mdhttps://github.com/dreadnode/AIRTBench-Code/blob/main/SECURITY.md
Taskfile.yamlhttps://github.com/dreadnode/AIRTBench-Code/blob/main/Taskfile.yaml
Taskfile.yamlhttps://github.com/dreadnode/AIRTBench-Code/blob/main/Taskfile.yaml
pyproject.tomlhttps://github.com/dreadnode/AIRTBench-Code/blob/main/pyproject.toml
pyproject.tomlhttps://github.com/dreadnode/AIRTBench-Code/blob/main/pyproject.toml
python.code-workspacehttps://github.com/dreadnode/AIRTBench-Code/blob/main/python.code-workspace
python.code-workspacehttps://github.com/dreadnode/AIRTBench-Code/blob/main/python.code-workspace
uv.lockhttps://github.com/dreadnode/AIRTBench-Code/blob/main/uv.lock
uv.lockhttps://github.com/dreadnode/AIRTBench-Code/blob/main/uv.lock
READMEhttps://github.com/dreadnode/AIRTBench-Code
Code of conducthttps://github.com/dreadnode/AIRTBench-Code
Contributinghttps://github.com/dreadnode/AIRTBench-Code
Apache-2.0 licensehttps://github.com/dreadnode/AIRTBench-Code
Securityhttps://github.com/dreadnode/AIRTBench-Code
https://github.com/dreadnode/AIRTBench-Code#airtbench-autonomous-ai-red-teaming-agent-code
https://camo.githubusercontent.com/ef090950f64502c0868208bd7453776fe43f5ce3f683f67ad4283b9bec3a9006/68747470733a2f2f64316c7070626c743974327831352e636c6f756466726f6e742e6e65742f6c6f676f732f35373134393238663363646330393530333735313538306366666265386430322e706e67
https://github.com/dreadnode/AIRTBench-Code/actions/workflows/pre-commit.yaml
https://github.com/dreadnode/AIRTBench-Code/actions/workflows/renovate.yaml
https://opensource.org/licenses/Apache-2.0
https://github.com/dreadnode/AIRTBench-Code/releases
https://arxiv.org/abs/2506.14682
https://huggingface.co/datasets/dreadnode/AIRTBench/blob/main/README.md
https://dreadnode.io/blog/ai-red-team-benchmark
https://docs.dreadnode.io/strikes/how-to/airtbench-agent
https://github.com/dreadnode/AIRTBench-Code/stargazers
https://github.com/dreadnode/AIRTBench-Code/pulls
AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Modelshttps://arxiv.org/abs/2506.14682
Do LLM Agents Have AI Red Team Capabilities? We Built a Benchmark to Find Outhttps://dreadnode.io/blog/ai-red-team-benchmark
AIRTBench: Autonomous AI Red Teaming Agent Codehttps://github.com/dreadnode/AIRTBench-Code#airtbench-autonomous-ai-red-teaming-agent-code
Agent Harness Constructionhttps://github.com/dreadnode/AIRTBench-Code#agent-harness-construction
Setuphttps://github.com/dreadnode/AIRTBench-Code#setup
Documentationhttps://github.com/dreadnode/AIRTBench-Code#documentation
Run the Evaluationhttps://github.com/dreadnode/AIRTBench-Code#run-the-evaluation
Basic Usagehttps://github.com/dreadnode/AIRTBench-Code#basic-usage
Challenge Filteringhttps://github.com/dreadnode/AIRTBench-Code#challenge-filtering
Resourceshttps://github.com/dreadnode/AIRTBench-Code#resources
Datasethttps://github.com/dreadnode/AIRTBench-Code#dataset
Citationhttps://github.com/dreadnode/AIRTBench-Code#citation
Model requestshttps://github.com/dreadnode/AIRTBench-Code#model-requests
🤝 Contributinghttps://github.com/dreadnode/AIRTBench-Code#-contributing
🔐 Securityhttps://github.com/dreadnode/AIRTBench-Code#-security
⭐ Star Historyhttps://github.com/dreadnode/AIRTBench-Code#-star-history
https://github.com/dreadnode/AIRTBench-Code#agent-harness-construction
https://github.com/dreadnode/AIRTBench-Code/blob/main/assets/airtbench_architecture_diagram_dark.png
https://github.com/dreadnode/AIRTBench-Code#setup
https://github.com/dreadnode/AIRTBench-Code#documentation
Dreadnode Strikes documentationhttps://docs.dreadnode.io/strikes/how-to/airtbench-agent
https://github.com/dreadnode/AIRTBench-Code#run-the-evaluation
docshttps://docs.Dreadnode.io/strikes/overview
herehttps://platform.dreadnode.io/waitlist/strikes
rigginghttps://docs.dreadnode.io/open-source/rigging/intro
Cruciblehttps://platform.dreadnode.io/crucible
Dockerfilehttps://github.com/dreadnode/AIRTBench-Code/blob/main/airtbench/container/Dockerfile
https://github.com/dreadnode/AIRTBench-Code#basic-usage
https://github.com/dreadnode/AIRTBench-Code#challenge-filtering
the challenge manifesthttps://github.com/dreadnode/AIRTBench-Code/blob/main/airtbench/challenges/.challenges.yaml
https://github.com/dreadnode/AIRTBench-Code#resources
📄 Paper on arXivhttps://arxiv.org/abs/2506.14682
📝 Blog posthttps://dreadnode.io/blog/ai-red-team-benchmark
https://github.com/dreadnode/AIRTBench-Code#dataset
🤗Hugging Facehttps://huggingface.co/datasets/dreadnode/AIRTBench/blob/main/README.md
datasethttps://github.com/dreadnode/AIRTBench-Code/blob/main/dataset/README.md
https://github.com/dreadnode/AIRTBench-Code#citation
https://github.com/dreadnode/AIRTBench-Code#model-requests
https://github.com/dreadnode/AIRTBench-Code#-contributing
Contributing Guidehttps://github.com/dreadnode/AIRTBench-Code/blob/main/docs/contributing.md
https://github.com/dreadnode/AIRTBench-Code#-security
Security Policyhttps://github.com/dreadnode/AIRTBench-Code/blob/main/SECURITY.md
https://github.com/dreadnode/AIRTBench-Code#-star-history
https://github.com/dreadnode/AIRTBench-Code/stargazers
https://star-history.com/#dreadnode/AIRTBench-Code&Date
arxiv.org/abs/2506.14682https://arxiv.org/abs/2506.14682
security https://github.com/topics/security
benchmarking https://github.com/topics/benchmarking
benchmark https://github.com/topics/benchmark
research https://github.com/topics/research
ai https://github.com/topics/ai
evaluations https://github.com/topics/evaluations
hacking https://github.com/topics/hacking
artificial-intelligence https://github.com/topics/artificial-intelligence
cybersecurity https://github.com/topics/cybersecurity
ctf https://github.com/topics/ctf
agents https://github.com/topics/agents
offensive-security https://github.com/topics/offensive-security
ai-agents https://github.com/topics/ai-agents
benchmark-datasets https://github.com/topics/benchmark-datasets
llm https://github.com/topics/llm
cyber-evals https://github.com/topics/cyber-evals
Readme https://github.com/dreadnode/AIRTBench-Code#readme-ov-file
Apache-2.0 license https://github.com/dreadnode/AIRTBench-Code#Apache-2.0-1-ov-file
Code of conduct https://github.com/dreadnode/AIRTBench-Code#coc-ov-file
Contributing https://github.com/dreadnode/AIRTBench-Code#contributing-ov-file
Security policy https://github.com/dreadnode/AIRTBench-Code#security-ov-file
Please reload this pagehttps://github.com/dreadnode/AIRTBench-Code
Activityhttps://github.com/dreadnode/AIRTBench-Code/activity
Custom propertieshttps://github.com/dreadnode/AIRTBench-Code/custom-properties
92 starshttps://github.com/dreadnode/AIRTBench-Code/stargazers
1 watchinghttps://github.com/dreadnode/AIRTBench-Code/watchers
13 forkshttps://github.com/dreadnode/AIRTBench-Code/forks
Report repository https://github.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fdreadnode%2FAIRTBench-Code&report=dreadnode+%28user%29
Releases 2https://github.com/dreadnode/AIRTBench-Code/releases
v1.0.1 Latest Jul 2, 2025 https://github.com/dreadnode/AIRTBench-Code/releases/tag/v1.0.1
+ 1 releasehttps://github.com/dreadnode/AIRTBench-Code/releases
Please reload this pagehttps://github.com/dreadnode/AIRTBench-Code
Contributors 3https://github.com/dreadnode/AIRTBench-Code/graphs/contributors
Please reload this pagehttps://github.com/dreadnode/AIRTBench-Code
Jupyter Notebook 87.9% https://github.com/dreadnode/AIRTBench-Code/search?l=jupyter-notebook
Python 11.3% https://github.com/dreadnode/AIRTBench-Code/search?l=python
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.