Title: Add `date_partition_column` to `SparkSource` · Issue #4835 · feast-dev/feast · GitHub
Open Graph Title: Add `date_partition_column` to `SparkSource` · Issue #4835 · feast-dev/feast
X Title: Add `date_partition_column` to `SparkSource` · Issue #4835 · feast-dev/feast
Description: Is your feature request related to a problem? Please describe. The current spark implementation scans over all parquet files. This process can be made faster and more efficient by specifying a date_partition_column. During execution, thi...
Open Graph Description: Is your feature request related to a problem? Please describe. The current spark implementation scans over all parquet files. This process can be made faster and more efficient by specifying a date...
X Description: Is your feature request related to a problem? Please describe. The current spark implementation scans over all parquet files. This process can be made faster and more efficient by specifying a date...
Opengraph URL: https://github.com/feast-dev/feast/issues/4835
X: @github
Domain: github.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Add `date_partition_column` to `SparkSource`","articleBody":"**Is your feature request related to a problem? Please describe.**\r\nThe current spark implementation scans over all parquet files. This process can be made faster and more efficient by specifying a `date_partition_column`. During execution, this column would be used to filter the data at a file level. Only files who's date is within the range would be scanned.\r\n\r\n**Describe the solution you'd like**\r\nAdd `date_partition_column` to `SparkSource`. A similar implementation exists for the `AthenaSource`\r\n\r\n**Describe alternatives you've considered**\r\nNone\r\n\r\nI have implemented this locally and it works. I'm happy to open a PR\r\n","author":{"url":"https://github.com/niklasvm","@type":"Person","name":"niklasvm"},"datePublished":"2024-12-11T09:53:37.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/4835/feast/issues/4835"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:fa3b5316-6492-f06a-9a6a-c07930e9ac61 |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | A5BE:1085F6:1319902:1995212:6970321A |
| html-safe-nonce | a0c953cec3bc1a16c4437cffd657b3f7943f7150655048a35c6451077e97e7e9 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJBNUJFOjEwODVGNjoxMzE5OTAyOjE5OTUyMTI6Njk3MDMyMUEiLCJ2aXNpdG9yX2lkIjoiNjg4Nzg1OTM0MjUzMDQ0MjUwIiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0= |
| visitor-hmac | 6995d6d26527fdd02675f7150502feafd7253b7ad7696214413fe27aa252f47c |
| hovercard-subject-tag | issue:2732417062 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/feast-dev/feast/4835/issue_layout |
| twitter:image | https://opengraph.githubassets.com/a92976820616a90e684ebc903df8272aa1b23e78cbb901a5e294997afcf20ad6/feast-dev/feast/issues/4835 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/a92976820616a90e684ebc903df8272aa1b23e78cbb901a5e294997afcf20ad6/feast-dev/feast/issues/4835 |
| og:image:alt | Is your feature request related to a problem? Please describe. The current spark implementation scans over all parquet files. This process can be made faster and more efficient by specifying a date... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | niklasvm |
| hostname | github.com |
| expected-hostname | github.com |
| None | 9920a62ba22d06470388e2904804fb7e5ec51c9e35f81784e9191394c74b2bd2 |
| turbo-cache-control | no-preview |
| go-import | github.com/feast-dev/feast git https://github.com/feast-dev/feast.git |
| octolytics-dimension-user_id | 57027613 |
| octolytics-dimension-user_login | feast-dev |
| octolytics-dimension-repository_id | 161133770 |
| octolytics-dimension-repository_nwo | feast-dev/feast |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 161133770 |
| octolytics-dimension-repository_network_root_nwo | feast-dev/feast |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | f643964067a552f02067066d6a910b2f90a5721f |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width