Title: Searcher with JaccardMeasure does not seem to work · Issue #9 · nullnull/simstring · GitHub
Open Graph Title: Searcher with JaccardMeasure does not seem to work · Issue #9 · nullnull/simstring
X Title: Searcher with JaccardMeasure does not seem to work · Issue #9 · nullnull/simstring
Description: Hi, I don't really know whether it's a bug or not. When replacing the CosineMeasure by JaccardMeasure in the MWE and using 1-chargrams, I got matches with scores below the threshold. from simstring.feature_extractor.character_ngram impor...
Open Graph Description: Hi, I don't really know whether it's a bug or not. When replacing the CosineMeasure by JaccardMeasure in the MWE and using 1-chargrams, I got matches with scores below the threshold. from simstring...
X Description: Hi, I don't really know whether it's a bug or not. When replacing the CosineMeasure by JaccardMeasure in the MWE and using 1-chargrams, I got matches with scores below the threshold. from s...
Opengraph URL: https://github.com/nullnull/simstring/issues/9
X: @github
Domain: patch-diff.githubusercontent.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Searcher with JaccardMeasure does not seem to work","articleBody":"Hi,\r\n\r\nI don't really know whether it's a bug or not. When replacing the ``CosineMeasure`` by ``JaccardMeasure`` in the MWE and using 1-chargrams, I got matches with scores below the threshold.\r\n\r\n```python\r\nfrom simstring.feature_extractor.character_ngram import CharacterNgramFeatureExtractor\r\nfrom simstring.measure.jaccard import JaccardMeasure\r\nfrom simstring.database.dict import DictDatabase\r\nfrom simstring.searcher import Searcher\r\n\r\ndb = DictDatabase(CharacterNgramFeatureExtractor(1))\r\ndb.add('fibrates')\r\n\r\nsearcher = Searcher(db, JaccardMeasure())\r\nresults = searcher.ranked_search('abattoirs', 0.8)\r\nprint(results)\r\n\r\n[[0.7, 'fibrates']]\r\n```","author":{"url":"https://github.com/jtourille","@type":"Person","name":"jtourille"},"datePublished":"2020-08-03T20:03:42.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/9/simstring/issues/9"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:18b32767-f7cd-fc64-ed7d-ccf3f2c34846 |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | E1D8:374910:91DB70:C7C0ED:69811E54 |
| html-safe-nonce | c160810c31a5954af683f67a3fb39b49f59f379dedfbd0af5388e5aaeb4d10d9 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJFMUQ4OjM3NDkxMDo5MURCNzA6QzdDMEVEOjY5ODExRTU0IiwidmlzaXRvcl9pZCI6IjgxMTAwOTAwNzEzODgwMDM5MjQiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ== |
| visitor-hmac | 73afe45dc41f0acd1892fa33b1459b5f832dc1acd0a46b8b9cfaa19e6e1f3940 |
| hovercard-subject-tag | issue:672311105 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/nullnull/simstring/9/issue_layout |
| twitter:image | https://opengraph.githubassets.com/573ffe39655bd10f45990123a9b08acf1f7e9ac0788b93d751d43a0285ad55aa/nullnull/simstring/issues/9 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/573ffe39655bd10f45990123a9b08acf1f7e9ac0788b93d751d43a0285ad55aa/nullnull/simstring/issues/9 |
| og:image:alt | Hi, I don't really know whether it's a bug or not. When replacing the CosineMeasure by JaccardMeasure in the MWE and using 1-chargrams, I got matches with scores below the threshold. from simstring... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | jtourille |
| hostname | github.com |
| expected-hostname | github.com |
| None | 39fe8101494cbb823c09b619b68c80cd4d05ab7279997038dbe06bb91608abe1 |
| turbo-cache-control | no-preview |
| go-import | github.com/nullnull/simstring git https://github.com/nullnull/simstring.git |
| octolytics-dimension-user_id | 1388551 |
| octolytics-dimension-user_login | nullnull |
| octolytics-dimension-repository_id | 7327946 |
| octolytics-dimension-repository_nwo | nullnull/simstring |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 7327946 |
| octolytics-dimension-repository_network_root_nwo | nullnull/simstring |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | d5b34a4e4898b066c629879feb4b184bc471d6a7 |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width