René's URL Explorer Experiment


Title: `base64` module: Link against SIMD library for 10x performance. · Issue #124951 · python/cpython · GitHub

Open Graph Title: `base64` module: Link against SIMD library for 10x performance. · Issue #124951 · python/cpython

X Title: `base64` module: Link against SIMD library for 10x performance. · Issue #124951 · python/cpython

Description: Performance enhancement Proposal: https://pypi.org/project/pybase64/ aka https://github.com/mayeut/pybase64 (BSD licensed) exists. On top of some of its own SIMD code for base64 module extra features (character translation)^, it links ag...

Open Graph Description: Performance enhancement Proposal: https://pypi.org/project/pybase64/ aka https://github.com/mayeut/pybase64 (BSD licensed) exists. On top of some of its own SIMD code for base64 module extra featur...

X Description: Performance enhancement Proposal: https://pypi.org/project/pybase64/ aka https://github.com/mayeut/pybase64 (BSD licensed) exists. On top of some of its own SIMD code for base64 module extra featur...

Opengraph URL: https://github.com/python/cpython/issues/124951

X: @github

direct link

Domain: github.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"`base64` module: Link against SIMD library for 10x performance.","articleBody":"# Performance enhancement\r\n\r\n### Proposal:\r\n\r\nhttps://pypi.org/project/pybase64/ aka https://github.com/mayeut/pybase64 (BSD licensed) exists. On top of some of its own SIMD code for base64 module extra features (character translation)^, it links against https://github.com/aklomp/base64, a BSD licensed C99 library with SIMD acceleration giving 5-20x performance on base64 encoding and decoding operations vs our existing generic byte based base64 C code.\r\n\r\nWe could adopt a bunch of the pybase64 code to make the default base64 module experience better - it is relatively straight forward extension module code (as one would expect). On the other hand, I expect pybase64 to still be where new development and further improvements in this space continue to happen as people who care strongly about performance need the latest and greatest from PyPI regardless of their current CPython version. (looping in @mayeut for thoughts on that)\r\n\r\n**Practicalities**: Library availability? we'd vendor a libbase64 build for use on our binary distributions. I don't think it is currently widely available (? I only did a quick search on Ubuntu) as a package on Linux distributions though so we'd currently need to vendor our own copy in tree to be fair and match the good performance there (yuck, but ideally only temporary until distros pick it up as a package of its own, consider it similar to a Modules/_decimal/libmpdec/ situation - our configure.ac finds an installed one \u0026 distros link against that)\r\n\r\n**Risks**: It is a new C library dependency. Security concerns within it thus become our own. As `base64` is frequently used to process untrusted input. But its surface of possible problems is limited (very simple data format). We should ensure the library gets proper [oss-fuzz](https://github.com/google/oss-fuzz) test coverage before adoption (@aklomp for visibility).\r\n\r\n---\r\n\r\n^ `bytes.translate`, `bytearray.translate`, or `str.translate` might benefit from similar SIMD treatment - which would be better from a CPython perspective than only doing that within this module?  If so, lets file a new issue just for that bit.\r\n\r\n---\r\n\r\n```\r\n❯ python -m pybase64 benchmark `which python`\r\npybase64 1.4.0 (C extension active - NEON)  # running on my Apple M3\r\nbench: altchars=None, validate=False\r\npybase64._pybase64.encodebytes:   4776.815 MB/s (5,936,128 bytes -\u003e 8,018,983 bytes)\r\npybase64._pybase64.b64encode:    11989.872 MB/s (5,936,128 bytes -\u003e 7,914,840 bytes)\r\npybase64._pybase64.b64decode:     3039.329 MB/s (7,914,840 bytes -\u003e 5,936,128 bytes)\r\nbase64.encodebytes:                292.876 MB/s (5,936,128 bytes -\u003e 8,018,983 bytes)\r\nbase64.b64encode:                  601.307 MB/s (5,936,128 bytes -\u003e 7,914,840 bytes)\r\nbase64.b64decode:                  492.088 MB/s (7,914,840 bytes -\u003e 5,936,128 bytes)\r\nbench: altchars=None, validate=True\r\npybase64._pybase64.b64encode:    12327.286 MB/s (5,936,128 bytes -\u003e 7,914,840 bytes)\r\npybase64._pybase64.b64decode:     8611.733 MB/s (7,914,840 bytes -\u003e 5,936,128 bytes)\r\nbase64.b64encode:                  597.389 MB/s (5,936,128 bytes -\u003e 7,914,840 bytes)\r\nbase64.b64decode:                  472.430 MB/s (7,914,840 bytes -\u003e 5,936,128 bytes)\r\nbench: altchars=b'-_', validate=False\r\npybase64._pybase64.b64encode:     1287.615 MB/s (5,936,128 bytes -\u003e 7,914,840 bytes)\r\npybase64._pybase64.b64decode:     2524.966 MB/s (7,914,840 bytes -\u003e 5,936,128 bytes)\r\nbase64.b64encode:                  473.320 MB/s (5,936,128 bytes -\u003e 7,914,840 bytes)\r\nbase64.b64decode:                  406.411 MB/s (7,914,840 bytes -\u003e 5,936,128 bytes)\r\nbench: altchars=b'-_', validate=True\r\npybase64._pybase64.b64encode:     1283.111 MB/s (5,936,128 bytes -\u003e 7,914,840 bytes)\r\npybase64._pybase64.b64decode:     6745.809 MB/s (7,914,840 bytes -\u003e 5,936,128 bytes)\r\nbase64.b64encode:                  464.526 MB/s (5,936,128 bytes -\u003e 7,914,840 bytes)\r\nbase64.b64decode:                  391.959 MB/s (7,914,840 bytes -\u003e 5,936,128 bytes)\r\n```\r\n\r\n### Has this already been discussed elsewhere?\r\n\r\nNo response given\r\n\r\n### Links to previous discussion of this feature:\r\n\r\nIf we spawn Discuss threads around this, lets edit and drop links here.\n\n\u003c!-- gh-linked-prs --\u003e\n### Linked PRs\n* gh-143262\n\u003c!-- /gh-linked-prs --\u003e\n","author":{"url":"https://github.com/gpshead","@type":"Person","name":"gpshead"},"datePublished":"2024-10-03T21:02:46.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":6},"url":"https://github.com/124951/cpython/issues/124951"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:82c05fb4-362f-cad0-1b0d-a197c576974a
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-idE8A2:3244AB:180C678:2050486:6969BFC2
html-safe-nonce09b54b5ea21984cf843207f76cbd6145a15efef4c953324995e659dd24c6ed8f
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJFOEEyOjMyNDRBQjoxODBDNjc4OjIwNTA0ODY6Njk2OUJGQzIiLCJ2aXNpdG9yX2lkIjoiNDIwODc0OTUwMTQ5ODM3NjEzMCIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmacc3f3f81467af3a3338ee7bc18fdfbdd404471a2839d6f01ac74f9aae20ef3ea8
hovercard-subject-tagissue:2564998356
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/python/cpython/124951/issue_layout
twitter:imagehttps://opengraph.githubassets.com/c0db3ddf8b54f89e01924cd68d33bbfe66f44423937ceea26d26ccccf004655a/python/cpython/issues/124951
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/c0db3ddf8b54f89e01924cd68d33bbfe66f44423937ceea26d26ccccf004655a/python/cpython/issues/124951
og:image:altPerformance enhancement Proposal: https://pypi.org/project/pybase64/ aka https://github.com/mayeut/pybase64 (BSD licensed) exists. On top of some of its own SIMD code for base64 module extra featur...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernamegpshead
hostnamegithub.com
expected-hostnamegithub.com
Noneacedec8b5f975d9e3d494ddd8f949b0b8a0de59d393901e26f73df9dcba80056
turbo-cache-controlno-preview
go-importgithub.com/python/cpython git https://github.com/python/cpython.git
octolytics-dimension-user_id1525981
octolytics-dimension-user_loginpython
octolytics-dimension-repository_id81598961
octolytics-dimension-repository_nwopython/cpython
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id81598961
octolytics-dimension-repository_network_root_nwopython/cpython
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release83c08c21cdda978090dc44364b71aa5bc6dcea79
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/python/cpython/issues/124951#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F124951
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F124951
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=python%2Fcpython
Reloadhttps://github.com/python/cpython/issues/124951
Reloadhttps://github.com/python/cpython/issues/124951
Reloadhttps://github.com/python/cpython/issues/124951
python https://github.com/python
cpythonhttps://github.com/python/cpython
Please reload this pagehttps://github.com/python/cpython/issues/124951
Notifications https://github.com/login?return_to=%2Fpython%2Fcpython
Fork 33.9k https://github.com/login?return_to=%2Fpython%2Fcpython
Star 71.1k https://github.com/login?return_to=%2Fpython%2Fcpython
Code https://github.com/python/cpython
Issues 5k+ https://github.com/python/cpython/issues
Pull requests 2.1k https://github.com/python/cpython/pulls
Actions https://github.com/python/cpython/actions
Projects 31 https://github.com/python/cpython/projects
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/python/cpython/security
Please reload this pagehttps://github.com/python/cpython/issues/124951
Insights https://github.com/python/cpython/pulse
Code https://github.com/python/cpython
Issues https://github.com/python/cpython/issues
Pull requests https://github.com/python/cpython/pulls
Actions https://github.com/python/cpython/actions
Projects https://github.com/python/cpython/projects
Security https://github.com/python/cpython/security
Insights https://github.com/python/cpython/pulse
New issuehttps://github.com/login?return_to=https://github.com/python/cpython/issues/124951
New issuehttps://github.com/login?return_to=https://github.com/python/cpython/issues/124951
base64 module: Link against SIMD library for 10x performance.https://github.com/python/cpython/issues/124951#top
performancePerformance or resource usagehttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22performance%22
stdlibStandard Library Python modules in the Lib/ directoryhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22stdlib%22
https://github.com/gpshead
https://github.com/gpshead
gpsheadhttps://github.com/gpshead
on Oct 3, 2024https://github.com/python/cpython/issues/124951#issue-2564998356
https://pypi.org/project/pybase64/https://pypi.org/project/pybase64/
https://github.com/mayeut/pybase64https://github.com/mayeut/pybase64
https://github.com/aklomp/base64https://github.com/aklomp/base64
@mayeuthttps://github.com/mayeut
oss-fuzzhttps://github.com/google/oss-fuzz
@aklomphttps://github.com/aklomp
gh-124951: Optimize base64 encode & decode for an easy 2-3x speedup [no SIMD] #143262https://github.com/python/cpython/pull/143262
performancePerformance or resource usagehttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22performance%22
stdlibStandard Library Python modules in the Lib/ directoryhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22stdlib%22
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.