Title: Feast push (Redshift/DynamoDb) not work with PushMode.ONLINE_AND_OFFLINE when more than 500 columns · Issue #3282 · feast-dev/feast · GitHub
Open Graph Title: Feast push (Redshift/DynamoDb) not work with PushMode.ONLINE_AND_OFFLINE when more than 500 columns · Issue #3282 · feast-dev/feast
X Title: Feast push (Redshift/DynamoDb) not work with PushMode.ONLINE_AND_OFFLINE when more than 500 columns · Issue #3282 · feast-dev/feast
Description: Expected Behavior Currently, we have a push source with Redshift Offline Store and DynamoDb Online Store. We built our view with more than 500 columns. Around 750 columns. We expected to ingest data in dynamo and redshift when we run fs....
Open Graph Description: Expected Behavior Currently, we have a push source with Redshift Offline Store and DynamoDb Online Store. We built our view with more than 500 columns. Around 750 columns. We expected to ingest dat...
X Description: Expected Behavior Currently, we have a push source with Redshift Offline Store and DynamoDb Online Store. We built our view with more than 500 columns. Around 750 columns. We expected to ingest dat...
Opengraph URL: https://github.com/feast-dev/feast/issues/3282
X: @github
Domain: github.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Feast push (Redshift/DynamoDb) not work with PushMode.ONLINE_AND_OFFLINE when more than 500 columns","articleBody":"## Expected Behavior \r\nCurrently, we have a push source with Redshift Offline Store and DynamoDb Online Store. \r\nWe built our view with more than 500 columns. Around 750 columns. \r\n\r\nWe expected to ingest data in dynamo and redshift when we run \r\n`fs.push(\"push_source\", df, to=PushMode.ONLINE_AND_OFFLINE)`\r\n\r\n## Current Behavior\r\n\r\nPush command raise an issue like` [ERROR] ValueError: The input dataframe has columns ..`\r\nThis issue come from `get_table_column_names_and_types `method in `write_to_offline_store` method.\r\nIn the method, we check if `if set(input_columns) != set(source_columns)` and raise the below issue if there are diff. \r\n\r\nIn case with more than 500 columns we get a diff because source_columns come from `get_table_column_names_and_types` method result where the result is define by MaxResults parameters. \r\n\r\n## Steps to reproduce\r\n```\r\nentity= Entity(\r\n name=\"entity\",\r\n join_keys=[\"entity_id\"],\r\n value_type=ValueType.INT64,\r\n)\r\n\r\npush_source = PushSource(\r\n name=\"push_source\",\r\n batch_source=RedshiftSource(\r\n table=\"fs_push_view\",\r\n timestamp_field=\"datecreation\",\r\n created_timestamp_column=\"created_at\"),\r\n)\r\n\r\nbesoin_embedding_push_view = FeatureView(\r\n name=\"push_view\",\r\n entities=[entity],\r\n schema=[Field(name=f\"field_{dim}\", dtype=types.Float64) for dim in range(768)],\r\n source=push_source \r\n)\r\n\r\nfs.push(\"push_source\", df, to=PushMode.ONLINE_AND_OFFLINE)\r\n\r\n```\r\n\r\n### Specifications\r\n\r\n- Version: 0.25.0\r\n- Platform: AWS\r\n- Subsystem:\r\n\r\n## Possible Solution\r\n\r\nIn my mind, we have two solutions:\r\n\r\n- Set higher MaxResults in describe_table method\r\n- Use NextToken to iterate through results\r\n\r\n","author":{"url":"https://github.com/beubeu13220","@type":"Person","name":"beubeu13220"},"datePublished":"2022-10-10T14:06:04.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":2},"url":"https://github.com/3282/feast/issues/3282"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:9bb0347d-74ac-bfb6-dcc5-49ee759b8e1c |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | E41A:2FB70B:AA92B4:F165B9:6978C46C |
| html-safe-nonce | 7b5a7cdc4429e10d9e5f8afb6c1fe27a6e68035717ea423b3bbb69d8981c1494 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJFNDFBOjJGQjcwQjpBQTkyQjQ6RjE2NUI5OjY5NzhDNDZDIiwidmlzaXRvcl9pZCI6IjE1NTg5ODA4ODI5MDk5Mzg3OTYiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ== |
| visitor-hmac | d6d3a37628799962684a32f4ada5f9b578f365ab0f92bc3a43939b1d7a927207 |
| hovercard-subject-tag | issue:1403237884 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/feast-dev/feast/3282/issue_layout |
| twitter:image | https://opengraph.githubassets.com/3b1317dc1826e919cfd5fffc7df4371aba3cb2644c9a6d814847baaac06db75c/feast-dev/feast/issues/3282 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/3b1317dc1826e919cfd5fffc7df4371aba3cb2644c9a6d814847baaac06db75c/feast-dev/feast/issues/3282 |
| og:image:alt | Expected Behavior Currently, we have a push source with Redshift Offline Store and DynamoDb Online Store. We built our view with more than 500 columns. Around 750 columns. We expected to ingest dat... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | beubeu13220 |
| hostname | github.com |
| expected-hostname | github.com |
| None | 8a71ca1f7ab5436216e1df86f398ef7a51ed3152c90e3f7332fc70cf97fcb9d6 |
| turbo-cache-control | no-preview |
| go-import | github.com/feast-dev/feast git https://github.com/feast-dev/feast.git |
| octolytics-dimension-user_id | 57027613 |
| octolytics-dimension-user_login | feast-dev |
| octolytics-dimension-repository_id | 161133770 |
| octolytics-dimension-repository_nwo | feast-dev/feast |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 161133770 |
| octolytics-dimension-repository_network_root_nwo | feast-dev/feast |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | eaed570eead52a9920afef75dc3eb6d826fdc5ef |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width