airbytehq / airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
https://airbyte.com
Other
16.2k stars 4.14k forks source link

Evaluate Databricks Destination and scope path to beta #21815

Closed grishick closed 1 year ago

grishick commented 1 year ago

We need to answer the following questions about our databricks destination connector:

When filing issues that come out of this research, please link them to this epic

grishick commented 1 year ago

timeboxing to 13 points

suhomud commented 1 year ago

Opened issues

  1. Current implementation supports S3 and Azure external storages. Should we extend it before moving to Beta? The full list of supported sources can be found here
grishick commented 1 year ago

Still TBD: manually test the connector

grishick commented 1 year ago
  1. Databricks connector use in-memory buffer. Should we migrate to File-based buffer?

Yes

Currently we are supporting External storage only so should we implement Managed storage in a scope of moving to Beta?

Yes

Current implementation supports S3 and Azure external storages. Should we extend it before moving to Beta? The full list of supported sources can be found here

No. S3 and Azure are OK for Beta.