René's URL Explorer Experiment


Title: Incorrect parsing of TarInfo header when GNU long name and type AREGTYPE are combined · Issue #141707 · python/cpython · GitHub

Open Graph Title: Incorrect parsing of TarInfo header when GNU long name and type AREGTYPE are combined · Issue #141707 · python/cpython

X Title: Incorrect parsing of TarInfo header when GNU long name and type AREGTYPE are combined · Issue #141707 · python/cpython

Description: Bug report Bug description: When an entry uses GNU long name encoding the tarfile module reads in the data blocks for the name and then calls self.fromtarfile() again to get the 'actual' header. This second header is the source of truth ...

Open Graph Description: Bug report Bug description: When an entry uses GNU long name encoding the tarfile module reads in the data blocks for the name and then calls self.fromtarfile() again to get the 'actual' header. Th...

X Description: Bug report Bug description: When an entry uses GNU long name encoding the tarfile module reads in the data blocks for the name and then calls self.fromtarfile() again to get the 'actual' he...

Opengraph URL: https://github.com/python/cpython/issues/141707

X: @github

direct link

Domain: patch-diff.githubusercontent.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Incorrect parsing of TarInfo header when GNU long name  and type AREGTYPE are combined","articleBody":"# Bug report\n\n### Bug description:\n\nWhen an entry uses GNU long name encoding the `tarfile` module reads in the data blocks for the name and then [calls self.fromtarfile() again](https://github.com/python/cpython/blob/4867f717e21c3b5f0ad0e81f950c69dac6c95e6e/Lib/tarfile.py#L1404) to get the 'actual' header. This second header is the source of truth for everything _except_ the name which is just garbage data.\n\nThe problem is that `fromtarfile()` eventually calls `frombuf()` where [this logic](https://github.com/python/cpython/blob/4867f717e21c3b5f0ad0e81f950c69dac6c95e6e/Lib/tarfile.py#L1310-L1311) incorrectly uses the garbage data and overrides the entry type to directory, corrupting the entry.\n\nBecause the entry is detected as a directory, the offset is not updated properly and the next call to read a TarInfo entry will usually result in an exception. However, the exception lands up in [this block](https://github.com/python/cpython/blob/4867f717e21c3b5f0ad0e81f950c69dac6c95e6e/Lib/tarfile.py#L2851-L2857) where neither of the `if` conditions are met, so the exception is silently discarded. `tarinfo` remains `None` and the code [eventually decides](https://github.com/python/cpython/blob/4867f717e21c3b5f0ad0e81f950c69dac6c95e6e/Lib/tarfile.py#L2881-L2882) that there are no more entries in the tar file.\n\nI initially ran into this issue due to reports of invalid sdists being generated by maturin.\nSee: https://github.com/PyO3/maturin/issues/2855\n\n### CPython versions tested on:\n\n3.9, 3.10, 3.11, 3.12, 3.13, 3.14\n\n### Operating systems tested on:\n\nmacOS, Linux\n\n\u003c!-- gh-linked-prs --\u003e\n### Linked PRs\n* gh-143157\n* gh-143934\n\u003c!-- /gh-linked-prs --\u003e\n","author":{"url":"https://github.com/e-nomem","@type":"Person","name":"e-nomem"},"datePublished":"2025-11-18T10:34:31.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":1},"url":"https://github.com/141707/cpython/issues/141707"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:3ac5c97e-f280-d525-c424-f746a30a31f6
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-idC4DA:3C8A20:899DD1F:B7B55E7:696E1833
html-safe-nonceb7d09b9a721458371f0ac2661702857036db6ec7be8286add0768a4ec49915b0
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJDNERBOjNDOEEyMDo4OTlERDFGOkI3QjU1RTc6Njk2RTE4MzMiLCJ2aXNpdG9yX2lkIjoiNDIzMTk1NzMyMjkyMjg1ODU0NyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac030a450f454ea8764eaab7642f9d81a692877498d0286d9a675ce9c9dbce5b57
hovercard-subject-tagissue:3637381854
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/python/cpython/141707/issue_layout
twitter:imagehttps://opengraph.githubassets.com/6a80d203aec9cb844f5dbf7dacfe4dbbb3cbeb7102f23f2297866518b9cdd141/python/cpython/issues/141707
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/6a80d203aec9cb844f5dbf7dacfe4dbbb3cbeb7102f23f2297866518b9cdd141/python/cpython/issues/141707
og:image:altBug report Bug description: When an entry uses GNU long name encoding the tarfile module reads in the data blocks for the name and then calls self.fromtarfile() again to get the 'actual' header. Th...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernamee-nomem
hostnamegithub.com
expected-hostnamegithub.com
None1a7d6d739bf034e67486b9f97a31887ca30302b72a0acac49b6bcddff34356d7
turbo-cache-controlno-preview
go-importgithub.com/python/cpython git https://github.com/python/cpython.git
octolytics-dimension-user_id1525981
octolytics-dimension-user_loginpython
octolytics-dimension-repository_id81598961
octolytics-dimension-repository_nwopython/cpython
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id81598961
octolytics-dimension-repository_network_root_nwopython/cpython
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release87d7872ec7094ed247923539669aabda9230966f
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/python/cpython/issues/141707#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F141707
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F141707
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=python%2Fcpython
Reloadhttps://patch-diff.githubusercontent.com/python/cpython/issues/141707
Reloadhttps://patch-diff.githubusercontent.com/python/cpython/issues/141707
Reloadhttps://patch-diff.githubusercontent.com/python/cpython/issues/141707
python https://patch-diff.githubusercontent.com/python
cpythonhttps://patch-diff.githubusercontent.com/python/cpython
Please reload this pagehttps://patch-diff.githubusercontent.com/python/cpython/issues/141707
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fpython%2Fcpython
Fork 33.9k https://patch-diff.githubusercontent.com/login?return_to=%2Fpython%2Fcpython
Star 71.1k https://patch-diff.githubusercontent.com/login?return_to=%2Fpython%2Fcpython
Code https://patch-diff.githubusercontent.com/python/cpython
Issues 5k+ https://patch-diff.githubusercontent.com/python/cpython/issues
Pull requests 2.1k https://patch-diff.githubusercontent.com/python/cpython/pulls
Actions https://patch-diff.githubusercontent.com/python/cpython/actions
Projects 31 https://patch-diff.githubusercontent.com/python/cpython/projects
Security Uh oh! There was an error while loading. Please reload this page. https://patch-diff.githubusercontent.com/python/cpython/security
Please reload this pagehttps://patch-diff.githubusercontent.com/python/cpython/issues/141707
Insights https://patch-diff.githubusercontent.com/python/cpython/pulse
Code https://patch-diff.githubusercontent.com/python/cpython
Issues https://patch-diff.githubusercontent.com/python/cpython/issues
Pull requests https://patch-diff.githubusercontent.com/python/cpython/pulls
Actions https://patch-diff.githubusercontent.com/python/cpython/actions
Projects https://patch-diff.githubusercontent.com/python/cpython/projects
Security https://patch-diff.githubusercontent.com/python/cpython/security
Insights https://patch-diff.githubusercontent.com/python/cpython/pulse
New issuehttps://patch-diff.githubusercontent.com/login?return_to=https://github.com/python/cpython/issues/141707
New issuehttps://patch-diff.githubusercontent.com/login?return_to=https://github.com/python/cpython/issues/141707
#143157https://github.com/python/cpython/pull/143157
Incorrect parsing of TarInfo header when GNU long name and type AREGTYPE are combinedhttps://patch-diff.githubusercontent.com/python/cpython/issues/141707#top
#143157https://github.com/python/cpython/pull/143157
stdlibStandard Library Python modules in the Lib/ directoryhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22stdlib%22
type-bugAn unexpected behavior, bug, or errorhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22type-bug%22
type-securityA security issuehttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22type-security%22
https://github.com/e-nomem
https://github.com/e-nomem
e-nomemhttps://github.com/e-nomem
on Nov 18, 2025https://github.com/python/cpython/issues/141707#issue-3637381854
calls self.fromtarfile() againhttps://github.com/python/cpython/blob/4867f717e21c3b5f0ad0e81f950c69dac6c95e6e/Lib/tarfile.py#L1404
this logichttps://github.com/python/cpython/blob/4867f717e21c3b5f0ad0e81f950c69dac6c95e6e/Lib/tarfile.py#L1310-L1311
this blockhttps://github.com/python/cpython/blob/4867f717e21c3b5f0ad0e81f950c69dac6c95e6e/Lib/tarfile.py#L2851-L2857
eventually decideshttps://github.com/python/cpython/blob/4867f717e21c3b5f0ad0e81f950c69dac6c95e6e/Lib/tarfile.py#L2881-L2882
PyO3/maturin#2855https://github.com/PyO3/maturin/issues/2855
gh-141707: Fix tarfile type corruption with GNU long names #143157https://github.com/python/cpython/pull/143157
gh-141707: Skip TarInfo DIRTYPE normalization during GNU long name ha… #143934https://github.com/python/cpython/pull/143934
stdlibStandard Library Python modules in the Lib/ directoryhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22stdlib%22
type-bugAn unexpected behavior, bug, or errorhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22type-bug%22
type-securityA security issuehttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22type-security%22
Tarfile issueshttps://github.com/orgs/python/projects/11
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.