René's URL Explorer Experiment


Title: Seeking advice: remove is painfully slow · Issue #25 · objectbox/objectbox-python · GitHub

Open Graph Title: Seeking advice: remove is painfully slow · Issue #25 · objectbox/objectbox-python

X Title: Seeking advice: remove is painfully slow · Issue #25 · objectbox/objectbox-python

Description: I'm finding that adding and searching an objectbox database is really fast. However, the remove operation is really slow (1 second per object.) The database is on a local NVME SSD drive. It contains about 20,000 hashes and takes about 6G...

Open Graph Description: I'm finding that adding and searching an objectbox database is really fast. However, the remove operation is really slow (1 second per object.) The database is on a local NVME SSD drive. It contain...

X Description: I'm finding that adding and searching an objectbox database is really fast. However, the remove operation is really slow (1 second per object.) The database is on a local NVME SSD drive. It con...

Opengraph URL: https://github.com/objectbox/objectbox-python/issues/25

X: @github

direct link

Domain: patch-diff.githubusercontent.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Seeking advice: remove is painfully slow","articleBody":"I'm finding that adding and searching an objectbox database is really fast. However, the remove operation is really slow (1 second per object.) The database is on a local NVME SSD drive. It contains about 20,000 hashes and takes about 6GB.\r\n\r\nMy find_unique hash_box.query operation is fast - it's literally the call to hash_box.remove that takes the time.\r\n\r\nWhat am I doing wrong?\r\n\r\n``` python\r\n@Entity()\r\nclass ImHash:\r\n    id = Id\r\n    key = String(index=Index(IndexType.HASH), unique=True)\r\n    cos_value = Float32Vector(index=HnswIndex(\r\n        dimensions=62720,\r\n        distance_type=VectorDistanceType.COSINE,\r\n    ))\r\n\r\n\r\ndef hash_image(im: Image.Image) -\u003e list[float]:\r\n    vector = img2vec.get_vec(im, tensor=True)\r\n    return vector.detach().cpu().numpy().flatten()\r\n\r\n\r\ndef hash_and_store(name_or_fp, key: str):\r\n    im = Image.open(name_or_fp)\r\n    h = hash_image(im)\r\n    ih = find_unique(key)\r\n    if ih is None:\r\n        # create\r\n        ih = ImHash()\r\n        ih.key = key\r\n    ih.cos_value = h\r\n    with store_lock:\r\n        hash_box.put(ih)\r\n\r\n\r\ndef init(db_dir: pathlib.Path):\r\n    global store, hash_box, img2vec\r\n    store = Store(directory=str(db_dir / directory_name),\r\n                  model_json_file=str(db_dir / json_model_name),\r\n                  max_db_size_in_kb=10 * 1024 * 1024)\r\n    hash_box = store.box(ImHash)\r\n    img2vec = Img2Vec(cuda=False, model='efficientnet_b0')\r\n\r\n\r\ndef close():\r\n    store.close()\r\n\r\n\r\ndef find_unique(key: str):\r\n    with store_lock:\r\n        query = hash_box.query(ImHash.key.equals(key)).build()\r\n        result = query.find()\r\n    if len(result) == 0:\r\n        return None\r\n    elif len(result) \u003e 1:\r\n        print('Multiple matches found')\r\n        return None\r\n    else:\r\n        return result[0]\r\n\r\n\r\ndef find_similar(key: str) -\u003e list[tuple[ImHash, float]]:\r\n    target = find_unique(key)\r\n    with store_lock:\r\n        query = hash_box.query(ImHash.cos_value.nearest_neighbor(target.cos_value, 8)).build()\r\n        results = query.find_with_scores()\r\n    results.sort(key=lambda x: x[1])\r\n    return results\r\n\r\n\r\ndef remove(key: str):\r\n    target = find_unique(key)\r\n    if target is not None:\r\n        with store_lock:\r\n            hash_box.remove(target)\r\n\r\n\r\ndef remove_many(keys: list[str]):\r\n    with store.write_tx():\r\n        for k in keys:\r\n            i = find_unique(k)\r\n            if i is None:\r\n                print('Hash key \"%s\" was already gone' % k)\r\n            else:\r\n                with store_lock:\r\n                    hash_box.remove(i.id)\r\n```","author":{"url":"https://github.com/patknight","@type":"Person","name":"patknight"},"datePublished":"2024-11-17T21:40:06.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/25/objectbox-python/issues/25"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:34827eed-76b1-9003-a239-ddb3e6c607a7
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-idA478:371D0F:51D3473:6A60EDF:697DE24E
html-safe-nonce0785bfe16b5f9b4cb9df9f691d95f6500f3cf06fcef36d16d9852a1d3e9294a6
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJBNDc4OjM3MUQwRjo1MUQzNDczOjZBNjBFREY6Njk3REUyNEUiLCJ2aXNpdG9yX2lkIjoiODQ1NjU4OTU2MDE3NDczMTg1NSIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac7554b677b98a2e698ad27ca6fdb51e29989968655dfb2f3cea663784f03ec341
hovercard-subject-tagissue:2666574973
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/objectbox/objectbox-python/25/issue_layout
twitter:imagehttps://opengraph.githubassets.com/6bf90480331d87f27c72cf24f59bd0d8f08ca000b5cef8e57f7ea2ffee015c8f/objectbox/objectbox-python/issues/25
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/6bf90480331d87f27c72cf24f59bd0d8f08ca000b5cef8e57f7ea2ffee015c8f/objectbox/objectbox-python/issues/25
og:image:altI'm finding that adding and searching an objectbox database is really fast. However, the remove operation is really slow (1 second per object.) The database is on a local NVME SSD drive. It contain...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernamepatknight
hostnamegithub.com
expected-hostnamegithub.com
None60279d4097367e16897439d16d6bbe4180663db828c666eeed2656988ffe59f6
turbo-cache-controlno-preview
go-importgithub.com/objectbox/objectbox-python git https://github.com/objectbox/objectbox-python.git
octolytics-dimension-user_id22327943
octolytics-dimension-user_loginobjectbox
octolytics-dimension-repository_id185552041
octolytics-dimension-repository_nwoobjectbox/objectbox-python
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id185552041
octolytics-dimension-repository_network_root_nwoobjectbox/objectbox-python
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release7c85641c598ad130c74f7bcc27f58575cac69551
ui-targetcanary-2
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/objectbox/objectbox-python/issues/25#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fobjectbox%2Fobjectbox-python%2Fissues%2F25
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fobjectbox%2Fobjectbox-python%2Fissues%2F25
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=objectbox%2Fobjectbox-python
Reloadhttps://patch-diff.githubusercontent.com/objectbox/objectbox-python/issues/25
Reloadhttps://patch-diff.githubusercontent.com/objectbox/objectbox-python/issues/25
Reloadhttps://patch-diff.githubusercontent.com/objectbox/objectbox-python/issues/25
objectbox https://patch-diff.githubusercontent.com/objectbox
objectbox-pythonhttps://patch-diff.githubusercontent.com/objectbox/objectbox-python
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fobjectbox%2Fobjectbox-python
Fork 22 https://patch-diff.githubusercontent.com/login?return_to=%2Fobjectbox%2Fobjectbox-python
Star 176 https://patch-diff.githubusercontent.com/login?return_to=%2Fobjectbox%2Fobjectbox-python
Code https://patch-diff.githubusercontent.com/objectbox/objectbox-python
Issues 8 https://patch-diff.githubusercontent.com/objectbox/objectbox-python/issues
Pull requests 2 https://patch-diff.githubusercontent.com/objectbox/objectbox-python/pulls
Actions https://patch-diff.githubusercontent.com/objectbox/objectbox-python/actions
Security 0 https://patch-diff.githubusercontent.com/objectbox/objectbox-python/security
Insights https://patch-diff.githubusercontent.com/objectbox/objectbox-python/pulse
Code https://patch-diff.githubusercontent.com/objectbox/objectbox-python
Issues https://patch-diff.githubusercontent.com/objectbox/objectbox-python/issues
Pull requests https://patch-diff.githubusercontent.com/objectbox/objectbox-python/pulls
Actions https://patch-diff.githubusercontent.com/objectbox/objectbox-python/actions
Security https://patch-diff.githubusercontent.com/objectbox/objectbox-python/security
Insights https://patch-diff.githubusercontent.com/objectbox/objectbox-python/pulse
New issuehttps://patch-diff.githubusercontent.com/login?return_to=https://github.com/objectbox/objectbox-python/issues/25
New issuehttps://patch-diff.githubusercontent.com/login?return_to=https://github.com/objectbox/objectbox-python/issues/25
Seeking advice: remove is painfully slowhttps://patch-diff.githubusercontent.com/objectbox/objectbox-python/issues/25#top
https://github.com/patknight
https://github.com/patknight
patknighthttps://github.com/patknight
on Nov 17, 2024https://github.com/objectbox/objectbox-python/issues/25#issue-2666574973
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.