Title: Support transactions (and pandas.to_sql / read_sql_table) · Issue #72 · databricks/databricks-sql-python · GitHub
Open Graph Title: Support transactions (and pandas.to_sql / read_sql_table) · Issue #72 · databricks/databricks-sql-python
X Title: Support transactions (and pandas.to_sql / read_sql_table) · Issue #72 · databricks/databricks-sql-python
Description: It's a great project, and truly helps me in a variety of ways. However, I am having a few issues when utilizing it with Pandas. When reading a table I would like to do: def read( self, catalog_name: str, schema_name: str, table_name: str...
Open Graph Description: It's a great project, and truly helps me in a variety of ways. However, I am having a few issues when utilizing it with Pandas. When reading a table I would like to do: def read( self, catalog_name...
X Description: It's a great project, and truly helps me in a variety of ways. However, I am having a few issues when utilizing it with Pandas. When reading a table I would like to do: def read( self, catalog_...
Opengraph URL: https://github.com/databricks/databricks-sql-python/issues/72
X: @github
Domain: github.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Support transactions (and pandas.to_sql / read_sql_table)","articleBody":"It's a great project, and truly helps me in a variety of ways. However, I am having a few issues when utilizing it with Pandas.\r\n\r\nWhen reading a table I would like to do:\r\n\r\n```python\r\n def read(\r\n self, catalog_name: str, schema_name: str, table_name: str\r\n ) -\u003e DataFrame:\r\n with self._get_connection(catalog_name) as connection:\r\n iterator = pd.read_sql_table(\r\n schema=schema_name,\r\n table_name=table_name,\r\n con=connection,\r\n )\r\n```\r\nThis should return a pandas dataframe.\r\n\r\nHowever, I am required to do the following lacking features from your otherwise great connection. \r\n```python\r\ndef read(\r\n self, catalog_name: str, schema_name: str, table_name: str\r\n ) -\u003e pd.DataFrame:\r\n with self._get_connection(catalog_name) as connection:\r\n query = f\"SELECT * FROM {catalog_name}.{schema_name}.{table_name}\"\r\n self.logger.debug(\"Running query '%s'\", query)\r\n df = pd.read_sql(query, connection)\r\n self.logger.debug(df)\r\n return df\r\n```\r\n\r\n \r\nSimilarly I cannot use the `to_sql` on a dataframe where i'd like to do something like the following:\r\n```python\r\n def write(\r\n self,\r\n dataframe: pd.DataFrame,\r\n catalog_name: str,\r\n schema_name: str,\r\n table_name: str,\r\n ) -\u003e pd.DataFrame:\r\n with self._get_connection(catalog_name) as connection:\r\n dataframe.to_sql(name=table_name, con=connection, schema=schema_name)\r\n``` \r\n\r\nBoth fail\r\n```\r\ndatabricks.sql.exc.NotSupportedError: Transactions are not supported on Databricks\r\n````\r\n\r\n## Versions:\r\n\r\n```\r\npython 3.10.6\r\ndatabricks-sql-connector==2.2.1\r\npandas==1.5.2\r\n\r\n```","author":{"url":"https://github.com/C0DK","@type":"Person","name":"C0DK"},"datePublished":"2022-12-14T12:36:37.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":5},"url":"https://github.com/72/databricks-sql-python/issues/72"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:6b93ccef-f3eb-2f0e-26de-8842b06c2c46 |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | 88D0:167AC:52BEB9:6D2E37:697198C7 |
| html-safe-nonce | f14c7054bedeb5ebbfa668163116911132d441e1c67d743993744418e9d31f21 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4OEQwOjE2N0FDOjUyQkVCOTo2RDJFMzc6Njk3MTk4QzciLCJ2aXNpdG9yX2lkIjoiMjUzOTE1MDI0MzAwMTM3NDkxOSIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9 |
| visitor-hmac | 3bee8ebe5c2fb612e24a0c04239b0a19a433c9053d0b0d49a950a7e03d1dd0e8 |
| hovercard-subject-tag | issue:1496546761 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/databricks/databricks-sql-python/72/issue_layout |
| twitter:image | https://avatars.githubusercontent.com/u/4998052?s=400&v=4 |
| twitter:card | summary |
| og:image | https://avatars.githubusercontent.com/u/4998052?s=400&v=4 |
| og:image:alt | It's a great project, and truly helps me in a variety of ways. However, I am having a few issues when utilizing it with Pandas. When reading a table I would like to do: def read( self, catalog_name... |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | C0DK |
| hostname | github.com |
| expected-hostname | github.com |
| None | fdfdce9cd4f6ab85dca2b0d11264270829297c962dd5a79df449062d7822258f |
| turbo-cache-control | no-preview |
| go-import | github.com/databricks/databricks-sql-python git https://github.com/databricks/databricks-sql-python.git |
| octolytics-dimension-user_id | 4998052 |
| octolytics-dimension-user_login | databricks |
| octolytics-dimension-repository_id | 493695132 |
| octolytics-dimension-repository_nwo | databricks/databricks-sql-python |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 493695132 |
| octolytics-dimension-repository_network_root_nwo | databricks/databricks-sql-python |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 51c736e60b302bd039c9d5164573d176ceb24bb2 |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width