René's URL Explorer Experiment


Title: Bug: `binascii.a2b_uu` incorrectly assumes padded bytes are always whitespace · Issue #100308 · python/cpython · GitHub

Open Graph Title: Bug: `binascii.a2b_uu` incorrectly assumes padded bytes are always whitespace · Issue #100308 · python/cpython

X Title: Bug: `binascii.a2b_uu` incorrectly assumes padded bytes are always whitespace · Issue #100308 · python/cpython

Description: Bug Description I was decoding some UUEncoded data when I encountered a 'Trailing Garbage' error from the binascii.a2b_uu function. After digging into Linux's uu decode implementation(L248) and other resources (linked below) I'm decently...

Open Graph Description: Bug Description I was decoding some UUEncoded data when I encountered a 'Trailing Garbage' error from the binascii.a2b_uu function. After digging into Linux's uu decode implementation(L248) and oth...

X Description: Bug Description I was decoding some UUEncoded data when I encountered a 'Trailing Garbage' error from the binascii.a2b_uu function. After digging into Linux's uu decode implementation(L...

Opengraph URL: https://github.com/python/cpython/issues/100308

X: @github

direct link

Domain: github.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Bug: `binascii.a2b_uu` incorrectly assumes padded bytes are always whitespace","articleBody":"### Bug Description\r\nI was decoding some UUEncoded data when I encountered a 'Trailing Garbage' error from the `binascii.a2b_uu` function. After digging into [Linux's uu decode implementation](https://fossies.org/linux/uuencode/uudecode.c)(L248) and other resources (linked below) I'm decently certain the python implementation is bugged.\r\n\r\n### The following is what I tried:\r\n```python\r\nfrom binascii import a2b_uu\r\ns = '%-@     !'\r\ndecoded = a2b_uu(s)\r\n```\r\n### The expected output is:\r\n```python\r\nprint(decoded)  # b'6\\x00\\x00\\x00\\x00'\r\n```\r\n### The actual output is:\r\n```text\r\nTraceback (most recent call last):\r\n  File \"\u003cstdin\u003e\", line 1, in \u003cmodule\u003e\r\nbinascii.Error: Trailing garbage\r\n```\r\n\r\nNotice there are 5 bytes in the expected output (b'6\\x00\\x00\\x00\\x00') because the `%` (first byte of input string, `s`) means 5 bytes of data follow (ascii code 37 - 32 = 5). UUEncoding requires output be divisible by 3 bytes so an extra padding character is added. In this case it's an `!`.\r\n\r\nThe python implementation assumes the padding is always whitespace. Different uuencoders will use different characters for padding though. I've seen three so far: ` `, `` ` ``, and `!`.\r\n\r\n[The following several lines of code are the issue](https://github.com/python/cpython/blob/main/Modules/binascii.c#L280)\r\n\r\n### Proposed fix\r\nSimply remove the following lines (279 - 296). Or if we really want the verification of padding we can include the '!' in the condition of valid padding chars. (The linked linux implementation does not verify padding, however.) And based on my research, there isn't a well defined padding character so we will be jumping to the same potentially false conclusion that we have here: believing we've accounted for all the padding characters that exist in the wild.\r\n```c\r\n/*\r\n** Finally, check that if there's anything left on the line\r\n** that it's whitespace only.\r\n*/\r\nwhile( ascii_len-- \u003e 0 ) {\r\n    this_ch = *ascii_data++;\r\n    /* Extra '`' may be written as padding in some cases */\r\n    if ( this_ch != ' ' \u0026\u0026 this_ch != ' '+64 \u0026\u0026\r\n         this_ch != '\\n' \u0026\u0026 this_ch != '\\r' ) {\r\n        state = get_binascii_state(module);\r\n        if (state == NULL) {\r\n            return NULL;\r\n        }\r\n        PyErr_SetString(state-\u003eError, \"Trailing garbage\");\r\n        Py_DECREF(rv);\r\n        return NULL;\r\n    }\r\n}\r\n```\r\n\r\nProblematically, this bug propagated up to the uu_codec decode implementation as well. [See the following code](https://github.com/python/cpython/blob/main/Lib/encodings/uu_codec.py#L60)\r\n\r\nA comment indicates the caught exception and \"workaround\" are due to broken uuencoders. According to what I've read, it's the broken python binascii.a2b_uu that incorrectly assumes any padding bytes are ` ` or `` ` ``.\r\n\r\nHere are the sources for my understanding of uu encoding:\r\n[Examples of non whitespace padding](https://www.herongyang.com/Encoding/UUEncode-Algorithm.html)\r\n[Wikipedia uuencoding](https://en.wikipedia.org/wiki/Uuencoding)\r\n[Busybox uudecode implementation](https://elixir.bootlin.com/busybox/0.45/source/uudecode.c#L67)\r\n\r\nFollowing is an illustration that helped me find a sense of understanding:\r\n![uuencode-bug-explanation](https://user-images.githubusercontent.com/16959700/208971943-c4de3c28-77c4-43e5-9ae9-806ee7b750e5.png)\r\n\r\n[1] I couldn't find an RFC or other standards document so I looked for the earliest implementation I could find (1983 Linux implementation) along with the wikipedia entry.\r\n\r\n### In the meantime\r\nIf others encounter this issue I'm using the following workaround:\r\n```python\r\nimport binascii\r\nfrom binascii import a2b_uu\r\nfrom io import BytesIO\r\n\r\nmy_bytes = BytesIO()\r\nline_bytes = b'%-@     !'\r\nline = line_bytes.decode(encoding='ascii')\r\ntry:\r\n    my_bytes.write(a2b_uu(line))\r\nexcept binascii.Error as err:\r\n    if 'trailing garbage' in str(err).lower():\r\n        n_bytes = line_bytes[0] - 32\r\n        assert n_bytes \u003c= 45 and n_bytes \u003c= len(line[1:])\r\n        workaround_line = f'M{line[1:]}'  # replace first byte of UUEncoded line with max length specifier (M)\r\n        data = a2b_uu(workaround_line)[:n_bytes]\r\n        my_bytes.write(data)\r\n    else:\r\n        raise err\r\n```","author":{"url":"https://github.com/ajmedeio","@type":"Person","name":"ajmedeio"},"datePublished":"2022-12-16T20:52:31.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":5},"url":"https://github.com/100308/cpython/issues/100308"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:2144cca0-9a50-f2d1-ef35-ee2f4ae20a24
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-idD64A:1D772C:227B3BC:2D86257:696B1483
html-safe-nonce6ffa5550ca832a5e3b0592cb281d307b9bdb243885e13b84c67143451db421c5
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJENjRBOjFENzcyQzoyMjdCM0JDOjJEODYyNTc6Njk2QjE0ODMiLCJ2aXNpdG9yX2lkIjoiNjI0NjI0MjcwOTI0NDk0MTQ0MyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac6414a12f4ad8d85a745c073216e92ea3274e823d0d3479bf679fe240fcf114d4
hovercard-subject-tagissue:1500849960
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/python/cpython/100308/issue_layout
twitter:imagehttps://opengraph.githubassets.com/02f88ca3b4213efa5700aff20ed614c9a97d0d4011e90215a4d197efbd976df1/python/cpython/issues/100308
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/02f88ca3b4213efa5700aff20ed614c9a97d0d4011e90215a4d197efbd976df1/python/cpython/issues/100308
og:image:altBug Description I was decoding some UUEncoded data when I encountered a 'Trailing Garbage' error from the binascii.a2b_uu function. After digging into Linux's uu decode implementation(L248) and oth...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernameajmedeio
hostnamegithub.com
expected-hostnamegithub.com
None5f99f7c1d70f01da5b93e5ca90303359738944d8ab470e396496262c66e60b8d
turbo-cache-controlno-preview
go-importgithub.com/python/cpython git https://github.com/python/cpython.git
octolytics-dimension-user_id1525981
octolytics-dimension-user_loginpython
octolytics-dimension-repository_id81598961
octolytics-dimension-repository_nwopython/cpython
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id81598961
octolytics-dimension-repository_network_root_nwopython/cpython
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release82560a55c6b2054555076f46e683151ee28a19bc
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/python/cpython/issues/100308#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F100308
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F100308
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=python%2Fcpython
Reloadhttps://github.com/python/cpython/issues/100308
Reloadhttps://github.com/python/cpython/issues/100308
Reloadhttps://github.com/python/cpython/issues/100308
python https://github.com/python
cpythonhttps://github.com/python/cpython
Please reload this pagehttps://github.com/python/cpython/issues/100308
Notifications https://github.com/login?return_to=%2Fpython%2Fcpython
Fork 33.9k https://github.com/login?return_to=%2Fpython%2Fcpython
Star 71.1k https://github.com/login?return_to=%2Fpython%2Fcpython
Code https://github.com/python/cpython
Issues 5k+ https://github.com/python/cpython/issues
Pull requests 2.1k https://github.com/python/cpython/pulls
Actions https://github.com/python/cpython/actions
Projects 31 https://github.com/python/cpython/projects
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/python/cpython/security
Please reload this pagehttps://github.com/python/cpython/issues/100308
Insights https://github.com/python/cpython/pulse
Code https://github.com/python/cpython
Issues https://github.com/python/cpython/issues
Pull requests https://github.com/python/cpython/pulls
Actions https://github.com/python/cpython/actions
Projects https://github.com/python/cpython/projects
Security https://github.com/python/cpython/security
Insights https://github.com/python/cpython/pulse
New issuehttps://github.com/login?return_to=https://github.com/python/cpython/issues/100308
New issuehttps://github.com/login?return_to=https://github.com/python/cpython/issues/100308
Bug: binascii.a2b_uu incorrectly assumes padded bytes are always whitespacehttps://github.com/python/cpython/issues/100308#top
extension-modulesC modules in the Modules dirhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22extension-modules%22
https://github.com/ajmedeio
https://github.com/ajmedeio
ajmedeiohttps://github.com/ajmedeio
on Dec 16, 2022https://github.com/python/cpython/issues/100308#issue-1500849960
Linux's uu decode implementationhttps://fossies.org/linux/uuencode/uudecode.c
The following several lines of code are the issuehttps://github.com/python/cpython/blob/main/Modules/binascii.c#L280
See the following codehttps://github.com/python/cpython/blob/main/Lib/encodings/uu_codec.py#L60
Examples of non whitespace paddinghttps://www.herongyang.com/Encoding/UUEncode-Algorithm.html
Wikipedia uuencodinghttps://en.wikipedia.org/wiki/Uuencoding
Busybox uudecode implementationhttps://elixir.bootlin.com/busybox/0.45/source/uudecode.c#L67
https://user-images.githubusercontent.com/16959700/208971943-c4de3c28-77c4-43e5-9ae9-806ee7b750e5.png
extension-modulesC modules in the Modules dirhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22extension-modules%22
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.