René's URL Explorer Experiment


Title: Bytewax materializer materializes entire dataset on every pod · Issue #3786 · feast-dev/feast · GitHub

Open Graph Title: Bytewax materializer materializes entire dataset on every pod · Issue #3786 · feast-dev/feast

X Title: Bytewax materializer materializes entire dataset on every pod · Issue #3786 · feast-dev/feast

Description: Expected Behavior Each pod should materialize a separate batch of data Current Behavior Every pod redundantly materializes the entire dataset, causing a massive overspend on processing time and database writes Steps to reproduce Observed...

Open Graph Description: Expected Behavior Each pod should materialize a separate batch of data Current Behavior Every pod redundantly materializes the entire dataset, causing a massive overspend on processing time and dat...

X Description: Expected Behavior Each pod should materialize a separate batch of data Current Behavior Every pod redundantly materializes the entire dataset, causing a massive overspend on processing time and dat...

Opengraph URL: https://github.com/feast-dev/feast/issues/3786

X: @github

direct link

Domain: github.com


Hey, it has json ld scripts:
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Bytewax materializer materializes entire dataset on every pod","articleBody":"## Expected Behavior \r\n\r\nEach pod should materialize a separate batch of data\r\n\r\n## Current Behavior\r\n\r\nEvery pod redundantly materializes the entire dataset, causing a massive overspend on processing time and database writes\r\n\r\n## Steps to reproduce\r\n\r\nObserved with snowflake offline store and dynamo db online store.  Materialize a feature with enough records for the snowflake s3 integration to output multiple files.  View the logs of each pod and add up the total records materialized.  Observe that every pod has materialized the entire dataset\r\n\r\n### Specifications\r\n\r\n- Version: 0.31\r\n- Platform: fedora linux\r\n- Subsystem: bytewax batch_engine\r\n\r\n## Possible Solution\r\n\r\nUse JOB_COMPLETION_INDEX in each pods dataflow to select a single file to process\r\n","author":{"url":"https://github.com/james-crabtree-sp","@type":"Person","name":"james-crabtree-sp"},"datePublished":"2023-10-09T19:18:29.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/3786/feast/issues/3786"}

route-pattern/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controllervoltron_issues_fragments
route-actionissue_layout
fetch-noncev2:21312db3-e0a2-8f3c-0549-afe31b344417
current-catalog-service-hash81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-idDD7E:30581F:E2AEE3:1347A40:69729581
html-safe-nonce448e67fb912b85fbbc27bb12da5a808efa1b712d3a883b34a5c9228c3f6009d3
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJERDdFOjMwNTgxRjpFMkFFRTM6MTM0N0E0MDo2OTcyOTU4MSIsInZpc2l0b3JfaWQiOiI1Mjg2ODg2MTM3ODk5Njg1MjQ5IiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0=
visitor-hmacf88cc2c12ce6c2aecf3d7afe11319b9d36641385e2bb6408d6ef2f5bc096f38d
hovercard-subject-tagissue:1933654126
github-keyboard-shortcutsrepository,issues,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/issues_fragments/issue_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/feast-dev/feast/3786/issue_layout
twitter:imagehttps://opengraph.githubassets.com/13bd73029fcb6c67209cbf676fa21a8268d5398d35667b36e523999e76a20d63/feast-dev/feast/issues/3786
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/13bd73029fcb6c67209cbf676fa21a8268d5398d35667b36e523999e76a20d63/feast-dev/feast/issues/3786
og:image:altExpected Behavior Each pod should materialize a separate batch of data Current Behavior Every pod redundantly materializes the entire dataset, causing a massive overspend on processing time and dat...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernamejames-crabtree-sp
hostnamegithub.com
expected-hostnamegithub.com
None72bb1c46bb1ebdc0dc83a0a57b64c3b4d668c125d1125d94898213a4c9db8da2
turbo-cache-controlno-preview
go-importgithub.com/feast-dev/feast git https://github.com/feast-dev/feast.git
octolytics-dimension-user_id57027613
octolytics-dimension-user_loginfeast-dev
octolytics-dimension-repository_id161133770
octolytics-dimension-repository_nwofeast-dev/feast
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id161133770
octolytics-dimension-repository_network_root_nwofeast-dev/feast
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release7b2326416cb9f2fa4ab7b6ede33ad46d0dd431a1
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/feast-dev/feast/issues/3786#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Ffeast-dev%2Ffeast%2Fissues%2F3786
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Ffeast-dev%2Ffeast%2Fissues%2F3786
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=feast-dev%2Ffeast
Reloadhttps://github.com/feast-dev/feast/issues/3786
Reloadhttps://github.com/feast-dev/feast/issues/3786
Reloadhttps://github.com/feast-dev/feast/issues/3786
feast-dev https://github.com/feast-dev
feasthttps://github.com/feast-dev/feast
Notifications https://github.com/login?return_to=%2Ffeast-dev%2Ffeast
Fork 1.2k https://github.com/login?return_to=%2Ffeast-dev%2Ffeast
Star 6.6k https://github.com/login?return_to=%2Ffeast-dev%2Ffeast
Code https://github.com/feast-dev/feast
Issues 181 https://github.com/feast-dev/feast/issues
Pull requests 64 https://github.com/feast-dev/feast/pulls
Discussions https://github.com/feast-dev/feast/discussions
Actions https://github.com/feast-dev/feast/actions
Security 0 https://github.com/feast-dev/feast/security
Insights https://github.com/feast-dev/feast/pulse
Code https://github.com/feast-dev/feast
Issues https://github.com/feast-dev/feast/issues
Pull requests https://github.com/feast-dev/feast/pulls
Discussions https://github.com/feast-dev/feast/discussions
Actions https://github.com/feast-dev/feast/actions
Security https://github.com/feast-dev/feast/security
Insights https://github.com/feast-dev/feast/pulse
New issuehttps://github.com/login?return_to=https://github.com/feast-dev/feast/issues/3786
New issuehttps://github.com/login?return_to=https://github.com/feast-dev/feast/issues/3786
#3789https://github.com/feast-dev/feast/pull/3789
Bytewax materializer materializes entire dataset on every podhttps://github.com/feast-dev/feast/issues/3786#top
#3789https://github.com/feast-dev/feast/pull/3789
kind/bughttps://github.com/feast-dev/feast/issues?q=state%3Aopen%20label%3A%22kind%2Fbug%22
priority/p2https://github.com/feast-dev/feast/issues?q=state%3Aopen%20label%3A%22priority%2Fp2%22
https://github.com/james-crabtree-sp
https://github.com/james-crabtree-sp
james-crabtree-sphttps://github.com/james-crabtree-sp
on Oct 9, 2023https://github.com/feast-dev/feast/issues/3786#issue-1933654126
kind/bughttps://github.com/feast-dev/feast/issues?q=state%3Aopen%20label%3A%22kind%2Fbug%22
priority/p2https://github.com/feast-dev/feast/issues?q=state%3Aopen%20label%3A%22priority%2Fp2%22
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.