Title: Feat: New version of entity_key serDe · Issue #4283 · feast-dev/feast · GitHub
Open Graph Title: Feat: New version of entity_key serDe · Issue #4283 · feast-dev/feast
X Title: Feat: New version of entity_key serDe · Issue #4283 · feast-dev/feast
Description: Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] The current entity_key serDe (version 2) is below: def serialize_entity_key( enti...
Open Graph Description: Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] The current entity_key serDe (version 2) ...
X Description: Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] The current entity_key serDe (version...
Opengraph URL: https://github.com/feast-dev/feast/issues/4283
X: @github
Domain: github.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Feat: New version of entity_key serDe","articleBody":"**Is your feature request related to a problem? Please describe.**\r\nA clear and concise description of what the problem is. Ex. I'm always frustrated when [...]\r\n\r\nThe current entity_key serDe (version 2) is below:\r\n```\r\ndef serialize_entity_key(\r\n entity_key: EntityKeyProto, entity_key_serialization_version=1\r\n) -\u003e bytes:\r\n \"\"\"\r\n Serialize entity key to a bytestring so it can be used as a lookup key in a hash table.\r\n\r\n We need this encoding to be stable; therefore we cannot just use protobuf serialization\r\n here since it does not guarantee that two proto messages containing the same data will\r\n serialize to the same byte string[1].\r\n\r\n [1] https://developers.google.com/protocol-buffers/docs/encoding\r\n \"\"\"\r\n sorted_keys, sorted_values = zip(\r\n *sorted(zip(entity_key.join_keys, entity_key.entity_values))\r\n )\r\n\r\n output: List[bytes] = []\r\n for k in sorted_keys:\r\n output.append(struct.pack(\"\u003cI\", ValueType.STRING))\r\n output.append(k.encode(\"utf8\"))\r\n for v in sorted_values:\r\n val_bytes, value_type = _serialize_val(\r\n v.WhichOneof(\"val\"),\r\n v,\r\n entity_key_serialization_version=entity_key_serialization_version,\r\n )\r\n\r\n output.append(struct.pack(\"\u003cI\", value_type))\r\n\r\n output.append(struct.pack(\"\u003cI\", len(val_bytes)))\r\n output.append(val_bytes)\r\n\r\n return b\"\".join(output)\r\n```\r\n\r\ne.g, for `sorted_keys = {tuple: 1} item_id` and `sorted_values = {tuple: 1} int64_val: 1\\n` will give output:\r\n`[b'\\x02\\x00\\x00\\x00', b'item_id', b'\\x04\\x00\\x00\\x00', b'\\x08\\x00\\x00\\x00', b'\\x01\\x00\\x00\\x00\\x00\\x00\\x00\\x00']`\r\n\r\nThis makes deserialization not doable. In order to deserialize we can append the \"length\" of value to the `join_key`, such as for the same test key and value we can get the output:\r\n`[b'\\x02\\x00\\x00\\x00', b'\\x07\\x00\\x00\\x00', b'item_id', b'\\x04\\x00\\x00\\x00', b'\\x08\\x00\\x00\\x00', b'\\x01\\x00\\x00\\x00\\x00\\x00\\x00\\x00']`\r\n\r\nThen we can deserialize the bytes to proto.\r\n\r\n\r\n\r\n**Describe the solution you'd like**\r\nA clear and concise description of what you want to happen.\r\n\r\n\r\n\r\n**Describe alternatives you've considered**\r\nA clear and concise description of any alternative solutions or features you've considered.\r\n\r\n**Additional context**\r\nAdd any other context or screenshots about the feature request here.\r\n","author":{"url":"https://github.com/HaoXuAI","@type":"Person","name":"HaoXuAI"},"datePublished":"2024-06-16T19:00:04.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/4283/feast/issues/4283"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:1589793b-f84b-0c2a-c2d4-3e191fed6e68 |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | 89E8:277007:54A597:784856:696FAAAA |
| html-safe-nonce | 128da9eba6ba89b705402ee2b0d25f62fb5b929c0d7a8b292022fd03cc9e8c73 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4OUU4OjI3NzAwNzo1NEE1OTc6Nzg0ODU2OjY5NkZBQUFBIiwidmlzaXRvcl9pZCI6IjcwNjg2ODE5Mjk0MjIyNTI3MTQiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ== |
| visitor-hmac | aa540bb5104b78c60938e676c27b0cb5a24d62e1b8602bcb96b6b3749e46e083 |
| hovercard-subject-tag | issue:2355928562 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/feast-dev/feast/4283/issue_layout |
| twitter:image | https://opengraph.githubassets.com/97e1cd215c11cbecaf07728ab1aa99e1a5d17a890871225b9ce2dc46ebd060cd/feast-dev/feast/issues/4283 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/97e1cd215c11cbecaf07728ab1aa99e1a5d17a890871225b9ce2dc46ebd060cd/feast-dev/feast/issues/4283 |
| og:image:alt | Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] The current entity_key serDe (version 2) ... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | HaoXuAI |
| hostname | github.com |
| expected-hostname | github.com |
| None | 356c704aafcc9a6179b2bc62a546ee20a28226cdeddba29d8ae86c3750ef0f76 |
| turbo-cache-control | no-preview |
| go-import | github.com/feast-dev/feast git https://github.com/feast-dev/feast.git |
| octolytics-dimension-user_id | 57027613 |
| octolytics-dimension-user_login | feast-dev |
| octolytics-dimension-repository_id | 161133770 |
| octolytics-dimension-repository_nwo | feast-dev/feast |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 161133770 |
| octolytics-dimension-repository_network_root_nwo | feast-dev/feast |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | e19b0670387556fcdd8027326ad85eecb0b536dd |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width