René's URL Explorer Experiment


Title: Unable to read docx containing pictures linking to internal bookmarks: KeyError: "There is no item named 'word/#MyBookmark' in the archive" · Issue #902 · python-openxml/python-docx · GitHub

Open Graph Title: Unable to read docx containing pictures linking to internal bookmarks: KeyError: "There is no item named 'word/#MyBookmark' in the archive" · Issue #902 · python-openxml/python-docx

X Title: Unable to read docx containing pictures linking to internal bookmarks: KeyError: "There is no item named 'word/#MyBookmark' in the archive" · Issue #902 · python-openxml/python-docx

Description: The document below contains a picture with a hyperlink to an internal bookmark. PictureBookmarks.docx (The very last picture links to the very first Heading1) I get this error message when reading the file using python-docx: KeyError: "T...

Open Graph Description: The document below contains a picture with a hyperlink to an internal bookmark. PictureBookmarks.docx (The very last picture links to the very first Heading1) I get this error message when reading ...

X Description: The document below contains a picture with a hyperlink to an internal bookmark. PictureBookmarks.docx (The very last picture links to the very first Heading1) I get this error message when reading ...

Opengraph URL: https://github.com/python-openxml/python-docx/issues/902

X: @github

direct link

Domain: github.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Unable to read docx containing pictures linking to internal bookmarks: KeyError: \"There is no item named 'word/#MyBookmark' in the archive\"","articleBody":"The document below contains a picture with a hyperlink to an internal bookmark.\r\n\r\n[PictureBookmarks.docx](https://github.com/python-openxml/python-docx/files/5638403/PictureBookmarks.docx) (The very last picture links to the very first Heading1)\r\n\r\nI get this error message when reading the file using python-docx: `KeyError: \"There is no item named 'word/#MyBookmark' in the archive\"`\r\n\r\n## Stack trace\r\n```\r\n#  Command\r\nself.document = Document(document)\r\n\r\n#  Trace (from pytest)\r\n..\\..\\..\\..\\..\\python-docx\\docx\\api.py:32: in Document\r\n    document_part = Package.open(docx).main_document_part\r\n..\\..\\..\\..\\..\\python-docx\\docx\\opc\\package.py:117: in open\r\n    pkg_reader = PackageReader.from_file(pkg_file)\r\n..\\..\\..\\..\\..\\python-docx\\docx\\opc\\pkgreader.py:37: in from_file\r\n    phys_reader, pkg_srels, content_types\r\n..\\..\\..\\..\\..\\python-docx\\docx\\opc\\pkgreader.py:74: in _load_serialized_parts\r\n    for partname, blob, reltype, srels in part_walker:\r\n..\\..\\..\\..\\..\\python-docx\\docx\\opc\\pkgreader.py:119: in _walk_phys_parts\r\n    for partname, blob, reltype, srels in next_walker:\r\n..\\..\\..\\..\\..\\python-docx\\docx\\opc\\pkgreader.py:114: in _walk_phys_parts\r\n    blob = phys_reader.blob_for(partname)\r\n..\\..\\..\\..\\..\\python-docx\\docx\\opc\\phys_pkg.py:109: in blob_for\r\n    return self._zipf.read(pack_uri.membername)\r\nC:\\...\\lib\\zipfile.py:1337: in read\r\n    with self.open(name, \"r\", pwd) as fp:\r\nC:\\...\\lib\\zipfile.py:1375: in open\r\n    zinfo = self.getinfo(name)\r\n_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ \r\nself = \u003czipfile.ZipFile filename='C:\\\\...\\\\PictureBookmarks.docx' mode='r'\u003e, name = 'word/#MyBookmark'\r\n\r\n    def getinfo(self, name):\r\n        \"\"\"Return the instance of ZipInfo given 'name'.\"\"\"\r\n        info = self.NameToInfo.get(name)\r\n        if info is None:\r\n            raise KeyError(\r\n\u003e               'There is no item named %r in the archive' % name)\r\nE           KeyError: \"There is no item named 'word/#MyBookmark' in the archive\"\r\n```\r\n\r\n# How to recreate\r\nThis is achieved by:\r\n1. Adding a picture to the Word file\r\n2. Right-click =\u003e Link\r\n3. Add link to any internal bookmark.\r\n\r\nThen the hyperlink ends up like this, notice the `a:hlinkClick` relationship ID:\r\n\r\n```\r\n                \u003cw:drawing\u003e\r\n                    \u003cwp:inline distT=\"0\" distB=\"0\" distL=\"0\" distR=\"0\" wp14:anchorId=\"2A30E332\" wp14:editId=\"3F2B5F70\"\u003e\r\n                       (...)\r\n                        \u003ca:graphic xmlns:a=\"http://schemas.openxmlformats.org/drawingml/2006/main\"\u003e\r\n                            \u003ca:graphicData uri=\"http://schemas.openxmlformats.org/drawingml/2006/picture\"\u003e\r\n                                \u003cpic:pic xmlns:pic=\"http://schemas.openxmlformats.org/drawingml/2006/picture\"\u003e\r\n                                    \u003cpic:nvPicPr\u003e\r\n                                        \u003cpic:cNvPr id=\"28\" name=\"Picture 28\" descr=\"(...)\"\u003e\r\n                                            \u003ca:hlinkClick r:id=\"rId21\"/\u003e\r\n                                        \u003c/pic:cNvPr\u003e\r\n                                        \u003cpic:cNvPicPr/\u003e\r\n                                    \u003c/pic:nvPicPr\u003e\r\n                                    \u003cpic:blipFill\u003e\r\n                                        (...)\r\n                                    \u003c/pic:blipFill\u003e\r\n                                    (...)\r\n                                \u003c/pic:pic\u003e\r\n                            \u003c/a:graphicData\u003e\r\n                        \u003c/a:graphic\u003e\r\n                    \u003c/wp:inline\u003e\r\n                \u003c/w:drawing\u003e\r\n```\r\n\r\nNow, in `word/_rels/document.xml.rels`, we get:\r\n\r\n```\r\n    \u003cRelationship Id=\"rId21\" Type=\"http://schemas.openxmlformats.org/officeDocument/2006/relationships/hyperlink\" Target=\"#MyBookmark\"/\u003e\r\n```\r\n\r\nThis item bugs python-docx for me. I'll admit I'm using a 2.5-year-old version of the package, since I needed to modify stuff for my own usecase, so I am not sure whether this has been fixed after that. I was looking for whether this had been solved somehow, and it seems it is very much related to this issue.\r\n\r\n# Investigation\r\nI see in the `pkgreader` that the `target_mode` can be used to identify external targets, and that external targets receive special treatment to avoid such zipfile issues. External targets are recognized in the relationship file for e.g. hyperlinks to web sites, and add a `Target` attribute to the `\u003cRelationship\u003e` object.\r\n\r\nFrom what I gather, `RT.HYPERLINK` elements that have a `Target` starting with `#` should be treated specially - like some sort of internal bookmark relationship (or similar).","author":{"url":"https://github.com/aorsten","@type":"Person","name":"aorsten"},"datePublished":"2020-12-03T19:11:26.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":6},"url":"https://github.com/902/python-docx/issues/902"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:b8050fe0-3867-ff8c-5494-89bef60e7105
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-idEC0C:35E3DA:2D9D053:3C0B653:696DB1A2
html-safe-nonce72278b3baf3cca2c9038a53669f29a26e66555e6212074bbba91296f824974f3
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJFQzBDOjM1RTNEQToyRDlEMDUzOjNDMEI2NTM6Njk2REIxQTIiLCJ2aXNpdG9yX2lkIjoiNzU0NTA4MDE2MTQ1NTk0NDA5OCIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmacfa1068a72f5c46f667ef44d0b8a61cfab42783b7065b52405d8831df6f45373a
hovercard-subject-tagissue:756491255
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/python-openxml/python-docx/902/issue_layout
twitter:imagehttps://opengraph.githubassets.com/2788a4ffff3b1d39892d584a572950d3b6e1aef18e7bce5f704dd634d9c238a9/python-openxml/python-docx/issues/902
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/2788a4ffff3b1d39892d584a572950d3b6e1aef18e7bce5f704dd634d9c238a9/python-openxml/python-docx/issues/902
og:image:altThe document below contains a picture with a hyperlink to an internal bookmark. PictureBookmarks.docx (The very last picture links to the very first Heading1) I get this error message when reading ...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernameaorsten
hostnamegithub.com
expected-hostnamegithub.com
None4922b452d03cd8dbce479d866a11bc25b59ef6ee2da23aa9b0ddefa6bd4d0064
turbo-cache-controlno-preview
go-importgithub.com/python-openxml/python-docx git https://github.com/python-openxml/python-docx.git
octolytics-dimension-user_id3403760
octolytics-dimension-user_loginpython-openxml
octolytics-dimension-repository_id13592924
octolytics-dimension-repository_nwopython-openxml/python-docx
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id13592924
octolytics-dimension-repository_network_root_nwopython-openxml/python-docx
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release7e5ae23c70136152637ceee8d6faceb35596ec46
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/python-openxml/python-docx/issues/902#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython-openxml%2Fpython-docx%2Fissues%2F902
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython-openxml%2Fpython-docx%2Fissues%2F902
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=python-openxml%2Fpython-docx
Reloadhttps://github.com/python-openxml/python-docx/issues/902
Reloadhttps://github.com/python-openxml/python-docx/issues/902
Reloadhttps://github.com/python-openxml/python-docx/issues/902
python-openxml https://github.com/python-openxml
python-docxhttps://github.com/python-openxml/python-docx
Notifications https://github.com/login?return_to=%2Fpython-openxml%2Fpython-docx
Fork 1.3k https://github.com/login?return_to=%2Fpython-openxml%2Fpython-docx
Star 5.4k https://github.com/login?return_to=%2Fpython-openxml%2Fpython-docx
Code https://github.com/python-openxml/python-docx
Issues 577 https://github.com/python-openxml/python-docx/issues
Pull requests 121 https://github.com/python-openxml/python-docx/pulls
Actions https://github.com/python-openxml/python-docx/actions
Projects 0 https://github.com/python-openxml/python-docx/projects
Wiki https://github.com/python-openxml/python-docx/wiki
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/python-openxml/python-docx/security
Please reload this pagehttps://github.com/python-openxml/python-docx/issues/902
Insights https://github.com/python-openxml/python-docx/pulse
Code https://github.com/python-openxml/python-docx
Issues https://github.com/python-openxml/python-docx/issues
Pull requests https://github.com/python-openxml/python-docx/pulls
Actions https://github.com/python-openxml/python-docx/actions
Projects https://github.com/python-openxml/python-docx/projects
Wiki https://github.com/python-openxml/python-docx/wiki
Security https://github.com/python-openxml/python-docx/security
Insights https://github.com/python-openxml/python-docx/pulse
New issuehttps://github.com/login?return_to=https://github.com/python-openxml/python-docx/issues/902
New issuehttps://github.com/login?return_to=https://github.com/python-openxml/python-docx/issues/902
Unable to read docx containing pictures linking to internal bookmarks: KeyError: "There is no item named 'word/#MyBookmark' in the archive"https://github.com/python-openxml/python-docx/issues/902#top
shortlisthttps://github.com/python-openxml/python-docx/issues?q=state%3Aopen%20label%3A%22shortlist%22
https://github.com/aorsten
https://github.com/aorsten
aorstenhttps://github.com/aorsten
on Dec 3, 2020https://github.com/python-openxml/python-docx/issues/902#issue-756491255
PictureBookmarks.docxhttps://github.com/python-openxml/python-docx/files/5638403/PictureBookmarks.docx
shortlisthttps://github.com/python-openxml/python-docx/issues?q=state%3Aopen%20label%3A%22shortlist%22
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.