Title: Modularize ingestion distributed compute engine support · Issue #444 · feast-dev/feast · GitHub
Open Graph Title: Modularize ingestion distributed compute engine support · Issue #444 · feast-dev/feast
X Title: Modularize ingestion distributed compute engine support · Issue #444 · feast-dev/feast
Description: This is a companion to #402 and the larger topic of storage engine modularization which was realized in #529 and subsequent PRs that implemented the new interfaces. Just as adding support for new storage engines tends to cause a dependen...
Open Graph Description: This is a companion to #402 and the larger topic of storage engine modularization which was realized in #529 and subsequent PRs that implemented the new interfaces. Just as adding support for new s...
X Description: This is a companion to #402 and the larger topic of storage engine modularization which was realized in #529 and subsequent PRs that implemented the new interfaces. Just as adding support for new s...
Opengraph URL: https://github.com/feast-dev/feast/issues/444
X: @github
Domain: github.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Modularize ingestion distributed compute engine support","articleBody":"This is a companion to #402 and the larger topic of storage engine modularization which was realized in #529 and subsequent PRs that implemented the new interfaces.\r\n\r\nJust as adding support for new storage engines tends to cause a dependency explosion for Feast `ingestion` \u0026 `serving`, the same is true for Beam Runner / job management adapter glue in `core` (this all could move to `serving` with future plans, but that won't change the fundamental problem this issue is about).\r\n\r\nSo for both storage and compute engines, I feel that some modularity strategy is needed for loose binding at build time, configurable for runtime. The goals would be to:\r\n\r\n- Minimize dependency pains that developers and contributors to Feast need to deal with if they are not actively working on a particular stack. The dependency trees are often large and fragile, especially in the Hadoop ecosystem, such as Hive and Spark.\r\n- Reduce deployment bloat if operators wish to package Feast internally with only the module JARs they need to support their organization's stack. (IIRC last I checked, `hadoop-common` or `hadoop-client` leave you with close to 200MB of jars, and `beam-runners-spark` and `beam-sdks-java-io-hcatalog` among others have these deps [as provided scope, but the point stands I believe]).\r\n\r\nPossibilities might be OSGi or `java.util.ServiceLoader` (and Spring integration or alternatives thereof). Open to other ideas!\r\n\r\nRelates to #362 ","author":{"url":"https://github.com/ches","@type":"Person","name":"ches"},"datePublished":"2020-01-27T05:28:04.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":4},"url":"https://github.com/444/feast/issues/444"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:68630842-9bc9-a1bc-8fe9-fac36a030850 |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | 9A3A:3B9F7B:1D7809F:27ABB44:697AAE2F |
| html-safe-nonce | 8c6812183490dd68915339aa800e85ce38043826500dd90dc1e8c95f2b5ac143 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI5QTNBOjNCOUY3QjoxRDc4MDlGOjI3QUJCNDQ6Njk3QUFFMkYiLCJ2aXNpdG9yX2lkIjoiMjQ3MTkwMDAzMTYyODI1ODg2MyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9 |
| visitor-hmac | 6260568e6ef7d2d820345ae7429378b63dd1df8c4de34e1895f83eb93316f401 |
| hovercard-subject-tag | issue:555364165 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/feast-dev/feast/444/issue_layout |
| twitter:image | https://opengraph.githubassets.com/2d591aa5d44ea535a151f9469c52f6b476a9653037c46c4e1229e6a4b750737d/feast-dev/feast/issues/444 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/2d591aa5d44ea535a151f9469c52f6b476a9653037c46c4e1229e6a4b750737d/feast-dev/feast/issues/444 |
| og:image:alt | This is a companion to #402 and the larger topic of storage engine modularization which was realized in #529 and subsequent PRs that implemented the new interfaces. Just as adding support for new s... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | ches |
| hostname | github.com |
| expected-hostname | github.com |
| None | 33acda8ebf71b04c9d33b494d2b6e0db48193ec89eb3ad92a9539d491fa9a3da |
| turbo-cache-control | no-preview |
| go-import | github.com/feast-dev/feast git https://github.com/feast-dev/feast.git |
| octolytics-dimension-user_id | 57027613 |
| octolytics-dimension-user_login | feast-dev |
| octolytics-dimension-repository_id | 161133770 |
| octolytics-dimension-repository_nwo | feast-dev/feast |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 161133770 |
| octolytics-dimension-repository_network_root_nwo | feast-dev/feast |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 4f714a84d3384e467839505ed65f968fc01aedf1 |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width