René's URL Explorer Experiment


Title: Complementary re patterns such as [\s\S] or [\w\W] are much slower than . with DOTALL · Issue #111259 · python/cpython · GitHub

Open Graph Title: Complementary re patterns such as [\s\S] or [\w\W] are much slower than . with DOTALL · Issue #111259 · python/cpython

X Title: Complementary re patterns such as [\s\S] or [\w\W] are much slower than . with DOTALL · Issue #111259 · python/cpython

Description: Bug report Bug description: import re from time import perf_counter as time p1 = re.compile(r"[\s\S]*") p2 = re.compile(".*", re.DOTALL) s = "a"*10000 for p in (p1,p2): t0 = time() for i in range(10000): _=p.match(s) print(time()-t0) Run...

Open Graph Description: Bug report Bug description: import re from time import perf_counter as time p1 = re.compile(r"[\s\S]*") p2 = re.compile(".*", re.DOTALL) s = "a"*10000 for p in (p1,p2): t0 = time() for i in range(1...

X Description: Bug report Bug description: import re from time import perf_counter as time p1 = re.compile(r"[\s\S]*") p2 = re.compile(".*", re.DOTALL) s = "a"*10000 for p in (p1,p2)...

Opengraph URL: https://github.com/python/cpython/issues/111259

X: @github

direct link

Domain: github.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Complementary re patterns such as [\\s\\S] or [\\w\\W] are much slower than . with DOTALL ","articleBody":"# Bug report\n\n### Bug description:\n\n```python\nimport re\nfrom time import perf_counter as time\n\np1 = re.compile(r\"[\\s\\S]*\")\np2 = re.compile(\".*\", re.DOTALL)\n\ns = \"a\"*10000\nfor p in (p1,p2):\n    t0 = time()\n    for i in range(10000): _=p.match(s)\n    print(time()-t0)\n```\nRuntimes are 0.44 s vs 0.0016 s on my system. Instead of simplification, the [\\s\\S] is stepped through one after another. \\s does not match so then \\S is checked (the order [\\S\\s] is twice as fast for the string here). This is not solely an issue for larger matches. A 40 char string is processed half as fast when using [\\s\\S]. Even 10 chars take about 25% longer to process. I'm not completely sure whether this qualifies as a bug or an issue with documentation. Other languages don't have the DOTALL option and always rely on the first option. Plenty of posts on SO and elsewhere will thus advocate using [\\s\\S] as an all-matching regex pattern. Unsuspecting Python programmers such as @barneygale may expect [\\s\\S] to be identical to using a dot with DOTALL as seen below.\n\n@serhiy-storchaka\n\nhttps://github.com/python/cpython/blob/9bb202a1a90ef0edce20c495c9426d9766df11bb/Lib/pathlib.py#L126-L133\n\n### CPython versions tested on:\n\n3.11, 3.13\n\n### Operating systems tested on:\n\nLinux, Windows\n\n\u003c!-- gh-linked-prs --\u003e\n### Linked PRs\n* gh-111303\n* gh-120742\n* gh-120745\n* gh-120813\n* gh-120814\n\u003c!-- /gh-linked-prs --\u003e\n","author":{"url":"https://github.com/pan324","@type":"Person","name":"pan324"},"datePublished":"2023-10-24T11:10:09.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":3},"url":"https://github.com/111259/cpython/issues/111259"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:11f31b3c-b33d-3df1-a77a-34146d415c1e
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-id8584:35B8D6:2456C9B:31CB256:696AC57F
html-safe-nonce1dbd9566c85ee3769a55c5ba725571dcc13d6d4993e915cc03c0097dad378d8b
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4NTg0OjM1QjhENjoyNDU2QzlCOjMxQ0IyNTY6Njk2QUM1N0YiLCJ2aXNpdG9yX2lkIjoiNjI5NDc5MjEwOTAyNTgzODQ2MyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac17aa360ed5738c581464370f8d652f0ee4d862aa791e60b29e71a1e6306eb5da
hovercard-subject-tagissue:1959017462
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/python/cpython/111259/issue_layout
twitter:imagehttps://opengraph.githubassets.com/5f953b9df835065fa7524568e290f09d768755b09d14614d2395e8330161f8f0/python/cpython/issues/111259
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/5f953b9df835065fa7524568e290f09d768755b09d14614d2395e8330161f8f0/python/cpython/issues/111259
og:image:altBug report Bug description: import re from time import perf_counter as time p1 = re.compile(r"[\s\S]*") p2 = re.compile(".*", re.DOTALL) s = "a"*10000 for p in (p1,p2): t0 = time() for i in range(1...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernamepan324
hostnamegithub.com
expected-hostnamegithub.com
None986b6a1d774985095564e64d6963d11f094da3d0e2bfda2ab1a27d63662eb033
turbo-cache-controlno-preview
go-importgithub.com/python/cpython git https://github.com/python/cpython.git
octolytics-dimension-user_id1525981
octolytics-dimension-user_loginpython
octolytics-dimension-repository_id81598961
octolytics-dimension-repository_nwopython/cpython
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id81598961
octolytics-dimension-repository_network_root_nwopython/cpython
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release89ad2112b9c4e11df6a0c13c8c1f8eedd36b0977
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/python/cpython/issues/111259#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F111259
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F111259
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=python%2Fcpython
Reloadhttps://github.com/python/cpython/issues/111259
Reloadhttps://github.com/python/cpython/issues/111259
Reloadhttps://github.com/python/cpython/issues/111259
python https://github.com/python
cpythonhttps://github.com/python/cpython
Please reload this pagehttps://github.com/python/cpython/issues/111259
Notifications https://github.com/login?return_to=%2Fpython%2Fcpython
Fork 33.9k https://github.com/login?return_to=%2Fpython%2Fcpython
Star 71.1k https://github.com/login?return_to=%2Fpython%2Fcpython
Code https://github.com/python/cpython
Issues 5k+ https://github.com/python/cpython/issues
Pull requests 2.1k https://github.com/python/cpython/pulls
Actions https://github.com/python/cpython/actions
Projects 31 https://github.com/python/cpython/projects
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/python/cpython/security
Please reload this pagehttps://github.com/python/cpython/issues/111259
Insights https://github.com/python/cpython/pulse
Code https://github.com/python/cpython
Issues https://github.com/python/cpython/issues
Pull requests https://github.com/python/cpython/pulls
Actions https://github.com/python/cpython/actions
Projects https://github.com/python/cpython/projects
Security https://github.com/python/cpython/security
Insights https://github.com/python/cpython/pulse
New issuehttps://github.com/login?return_to=https://github.com/python/cpython/issues/111259
New issuehttps://github.com/login?return_to=https://github.com/python/cpython/issues/111259
Complementary re patterns such as [\s\S] or [\w\W] are much slower than . with DOTALL https://github.com/python/cpython/issues/111259#top
https://github.com/serhiy-storchaka
performancePerformance or resource usagehttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22performance%22
topic-regexhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22topic-regex%22
https://github.com/pan324
https://github.com/pan324
pan324https://github.com/pan324
on Oct 24, 2023https://github.com/python/cpython/issues/111259#issue-1959017462
@barneygalehttps://github.com/barneygale
@serhiy-storchakahttps://github.com/serhiy-storchaka
cpython/Lib/pathlib.pyhttps://github.com/python/cpython/blob/9bb202a1a90ef0edce20c495c9426d9766df11bb/Lib/pathlib.py#L126-L133
9bb202ahttps://github.com/python/cpython/commit/9bb202a1a90ef0edce20c495c9426d9766df11bb
gh-111259: Optimize recursive wildcards in pathlib #111303https://github.com/python/cpython/pull/111303
gh-111259: Optimize complementary character sets in RE #120742https://github.com/python/cpython/pull/120742
gh-111259: Document idiomatic RE pattern (?s:.) that matches any character #120745https://github.com/python/cpython/pull/120745
[3.13] gh-111259: Document idiomatic RE pattern (?s:.) that matches any character (GH-120745) #120813https://github.com/python/cpython/pull/120813
[3.12] gh-111259: Document idiomatic RE pattern (?s:.) that matches any character (GH-120745) #120814https://github.com/python/cpython/pull/120814
serhiy-storchakahttps://github.com/serhiy-storchaka
performancePerformance or resource usagehttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22performance%22
topic-regexhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22topic-regex%22
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.