René's URL Explorer Experiment


Title: Dataclasses - Improve the performance of asdict/astuple for common types and default values · Issue #103000 · python/cpython · GitHub

Open Graph Title: Dataclasses - Improve the performance of asdict/astuple for common types and default values · Issue #103000 · python/cpython

X Title: Dataclasses - Improve the performance of asdict/astuple for common types and default values · Issue #103000 · python/cpython

Description: Feature or enhancement Improve the performance of asdict/astuple in common cases by making a shortcut for common types that are unaffected by deepcopy in the inner loop. Also special casing for the default dict_factory=dict to construct ...

Open Graph Description: Feature or enhancement Improve the performance of asdict/astuple in common cases by making a shortcut for common types that are unaffected by deepcopy in the inner loop. Also special casing for the...

X Description: Feature or enhancement Improve the performance of asdict/astuple in common cases by making a shortcut for common types that are unaffected by deepcopy in the inner loop. Also special casing for the...

Opengraph URL: https://github.com/python/cpython/issues/103000

X: @github

direct link

Domain: github.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Dataclasses - Improve the performance of asdict/astuple for common types and default values","articleBody":"# Feature or enhancement\r\n\r\nImprove the performance of asdict/astuple in common cases by making a shortcut for common types that are unaffected by deepcopy in the inner loop. Also special casing for the default `dict_factory=dict` to construct the dictionary directly.\r\n\r\nThe goal here is to improve performance in common cases without significantly impacting less common cases, while not changing the API or output in any way.\r\n\r\n# Pitch\r\n\r\nIn cases where a dataclass contains a lot of data of common python types (eg: bool/str/int/float) currently the inner loops for `asdict` and `astuple` require the values to be compared to check if they are dataclasses, namedtuples, lists, tuples, and then dictionaries before passing them to `deepcopy`. This proposes to special case and shortcut objects of types where `deepcopy` returns the object unchanged.\r\n\r\nIt is much faster for these cases to instead check for them at the first opportunity and shortcut their return, skipping the recursive call and all of the other comparisons. In the case where this is being used to prepare an object to serialize to JSON this can be quite significant as this covers most of the remaining types handled by the stdlib `json` module.\r\n\r\nNote: Anything that skips deepcopy with this alteration is already unchanged as`deepcopy(obj) is obj` is always True for these types.\r\n\r\nCurrently when constructing the `dict` for a dataclass, a list of tuples is created and passed to the `dict_factory` constructor. In the case where the `dict_factory` constructor is the default - `dict` - it is faster to construct the dictionary directly.\r\n\r\n# Previous discussion\r\n\r\nDiscussed here with a few more details and earlier examples: https://discuss.python.org/t/dataclasses-make-asdict-astuple-faster-by-skipping-deepcopy-for-objects-where-deepcopy-obj-is-obj/24662\r\n\r\n# Code Details\r\n## Types to skip deepcopy\r\n\r\nThis is the current set of types to be checked for and shortcut returned, ordered in a way that I think makes more sense for `dataclasses` than the original ordering copied from the `copy` module. These are known to be safe to skip as they are all sent to `_deepcopy_atomic` (which returns the original object) in the `copy` module. \r\n\r\n```python\r\n# Types for which deepcopy(obj) is known to return obj unmodified\r\n# Used to skip deepcopy in asdict and astuple for performance\r\n_ATOMIC_TYPES = {\r\n    # Common JSON Serializable types\r\n    types.NoneType,\r\n    bool,\r\n    int,\r\n    float,\r\n    complex,\r\n    bytes,\r\n    str,\r\n    # Other types that are also unaffected by deepcopy\r\n    types.EllipsisType,\r\n    types.NotImplementedType,\r\n    types.CodeType,\r\n    types.BuiltinFunctionType,\r\n    types.FunctionType,\r\n    type,\r\n    range,\r\n    property,\r\n    # weakref.ref,  # weakref is not currently imported by dataclasses directly\r\n}\r\n```\r\n\r\n## Function changes\r\n\r\nWith that added the change is essentially replacing each instance of\r\n\r\n```python\r\n_asdict_inner(v, dict_factory)\r\n```\r\n\r\ninside `_asdict_inner`, with\r\n\r\n```python\r\nv if type(v) in _ATOMIC_TYPES else _asdict_inner(v, dict_factory)\r\n```\r\n\r\nInstances of subclasses of these types are not guaranteed to have `deepcopy(obj) is obj` so this checks specifically for instances of the base types.\r\n\r\n# Performance tests\r\n\r\nTest file: https://gist.github.com/DavidCEllis/a2c2ceeeeda2d1ac509fb8877e5fb60d\r\n\r\nResults on my development machine (not a perfectly stable test machine, but these differences are large enough).\r\n\r\n## Main\r\n\r\nCurrent Main python branch:\r\n```\r\nDataclasses asdict/astuple speed tests\r\n--------------------------------------\r\nPython v3.12.0alpha6\r\nGIT branch: main\r\nTest Iterations: 10000\r\nList of Int case asdict: 5.80s\r\n\r\nTest Iterations: 1000\r\nList of Decimal case asdict: 0.65s\r\n\r\nTest Iterations: 1000000\r\nBasic types case asdict: 3.76s\r\nBasic types astuple: 3.48s\r\n\r\nTest Iterations: 100000\r\nOpaque types asdict: 2.15s\r\nOpaque types astuple: 2.11s\r\n\r\nTest Iterations: 100\r\nMixed containers asdict: 3.66s\r\nMixed containers astuple: 3.28s\r\n```\r\n\r\n## Modified\r\n\r\n[Modified Branch](https://github.com/DavidCEllis/cpython/blob/faster_dataclasses_serialize/Lib/dataclasses.py):\r\n\r\n```\r\nDataclasses asdict/astuple speed tests\r\n--------------------------------------\r\nPython v3.12.0alpha6\r\nGIT branch: faster_dataclasses_serialize\r\nTest Iterations: 10000\r\nList of Int case asdict: 0.53s\r\n\r\nTest Iterations: 1000\r\nList of Decimal case asdict: 0.68s\r\n\r\nTest Iterations: 1000000\r\nBasic types case asdict: 1.33s\r\nBasic types astuple: 1.28s\r\n\r\nTest Iterations: 100000\r\nOpaque types asdict: 2.14s\r\nOpaque types astuple: 2.13s\r\n\r\nTest Iterations: 100\r\nMixed containers asdict: 1.99s\r\nMixed containers astuple: 1.84s\r\n```\n\n\u003c!-- gh-linked-prs --\u003e\n### Linked PRs\n* gh-103005\n* gh-104364\n\u003c!-- /gh-linked-prs --\u003e\n","author":{"url":"https://github.com/DavidCEllis","@type":"Person","name":"DavidCEllis"},"datePublished":"2023-03-24T12:09:49.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":5},"url":"https://github.com/103000/cpython/issues/103000"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:ae9ee0f5-77c7-14ce-8ff0-5d3ed0d3d9ef
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-idE8A4:680AB:90E3B0:C3085C:6969BEAB
html-safe-nonceeb4cb76673bbf3f06abc9c04128f9076bebde73a3b0f69f3754311507dd13664
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJFOEE0OjY4MEFCOjkwRTNCMDpDMzA4NUM6Njk2OUJFQUIiLCJ2aXNpdG9yX2lkIjoiNjQ1MzI3ODI5MjAzMzc4MTQxOSIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmace7eebe2a88fdd5e5c330639f448e026cc460a28903c424eedf778735dc8a49a9
hovercard-subject-tagissue:1639276455
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/python/cpython/103000/issue_layout
twitter:imagehttps://opengraph.githubassets.com/d877e44471749303d189a79e23354b02c5a6c2898664037935b576de79d190ad/python/cpython/issues/103000
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/d877e44471749303d189a79e23354b02c5a6c2898664037935b576de79d190ad/python/cpython/issues/103000
og:image:altFeature or enhancement Improve the performance of asdict/astuple in common cases by making a shortcut for common types that are unaffected by deepcopy in the inner loop. Also special casing for the...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernameDavidCEllis
hostnamegithub.com
expected-hostnamegithub.com
Noneacedec8b5f975d9e3d494ddd8f949b0b8a0de59d393901e26f73df9dcba80056
turbo-cache-controlno-preview
go-importgithub.com/python/cpython git https://github.com/python/cpython.git
octolytics-dimension-user_id1525981
octolytics-dimension-user_loginpython
octolytics-dimension-repository_id81598961
octolytics-dimension-repository_nwopython/cpython
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id81598961
octolytics-dimension-repository_network_root_nwopython/cpython
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release83c08c21cdda978090dc44364b71aa5bc6dcea79
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/python/cpython/issues/103000#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F103000
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fpython%2Fcpython%2Fissues%2F103000
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=python%2Fcpython
Reloadhttps://github.com/python/cpython/issues/103000
Reloadhttps://github.com/python/cpython/issues/103000
Reloadhttps://github.com/python/cpython/issues/103000
python https://github.com/python
cpythonhttps://github.com/python/cpython
Please reload this pagehttps://github.com/python/cpython/issues/103000
Notifications https://github.com/login?return_to=%2Fpython%2Fcpython
Fork 33.9k https://github.com/login?return_to=%2Fpython%2Fcpython
Star 71.1k https://github.com/login?return_to=%2Fpython%2Fcpython
Code https://github.com/python/cpython
Issues 5k+ https://github.com/python/cpython/issues
Pull requests 2.1k https://github.com/python/cpython/pulls
Actions https://github.com/python/cpython/actions
Projects 31 https://github.com/python/cpython/projects
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/python/cpython/security
Please reload this pagehttps://github.com/python/cpython/issues/103000
Insights https://github.com/python/cpython/pulse
Code https://github.com/python/cpython
Issues https://github.com/python/cpython/issues
Pull requests https://github.com/python/cpython/pulls
Actions https://github.com/python/cpython/actions
Projects https://github.com/python/cpython/projects
Security https://github.com/python/cpython/security
Insights https://github.com/python/cpython/pulse
New issuehttps://github.com/login?return_to=https://github.com/python/cpython/issues/103000
New issuehttps://github.com/login?return_to=https://github.com/python/cpython/issues/103000
Dataclasses - Improve the performance of asdict/astuple for common types and default valueshttps://github.com/python/cpython/issues/103000#top
3.12only security fixeshttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%223.12%22
performancePerformance or resource usagehttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22performance%22
stdlibStandard Library Python modules in the Lib/ directoryhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22stdlib%22
type-featureA feature request or enhancementhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22type-feature%22
https://github.com/DavidCEllis
https://github.com/DavidCEllis
DavidCEllishttps://github.com/DavidCEllis
on Mar 24, 2023https://github.com/python/cpython/issues/103000#issue-1639276455
https://discuss.python.org/t/dataclasses-make-asdict-astuple-faster-by-skipping-deepcopy-for-objects-where-deepcopy-obj-is-obj/24662https://discuss.python.org/t/dataclasses-make-asdict-astuple-faster-by-skipping-deepcopy-for-objects-where-deepcopy-obj-is-obj/24662
https://gist.github.com/DavidCEllis/a2c2ceeeeda2d1ac509fb8877e5fb60dhttps://gist.github.com/DavidCEllis/a2c2ceeeeda2d1ac509fb8877e5fb60d
Modified Branchhttps://github.com/DavidCEllis/cpython/blob/faster_dataclasses_serialize/Lib/dataclasses.py
gh-103000: Optimise dataclasses asdict/astuple for common types #103005https://github.com/python/cpython/pull/103005
gh-103000: Optimise dataclasses.asdict for the common case #104364https://github.com/python/cpython/pull/104364
3.12only security fixeshttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%223.12%22
performancePerformance or resource usagehttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22performance%22
stdlibStandard Library Python modules in the Lib/ directoryhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22stdlib%22
type-featureA feature request or enhancementhttps://github.com/python/cpython/issues?q=state%3Aopen%20label%3A%22type-feature%22
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.