databrickslabs / dlt-meta

This is metadata driven DLT based framework for bronze/silver pipelines
Other
127 stars 55 forks source link

Unity dataflowspec tables #3

Closed Vosec closed 6 months ago

Vosec commented 1 year ago

Hello,

if the dataflowspecs tables + bronze/silver tables are stored in schema managed by UC, what is the reason to use/set paths for these tables? I'm unable to create dataflowspec tables if I want them to be stored in schema managed by UC. If I store it in hive_metastore, it runs fine.

The main issue is that users don't know the paths for the tables managed by UC, they should only reference the tables names in specific schema. Or am I missing something?

"silver_dataflowspec_table": "silver_dataflowspec_table", "silver_dataflowspec_path": "dbfs:/onboarding_tables_cdc/silver", "bronze_dataflowspec_table": "bronze_dataflowspec_table", "bronze_dataflowspec_path": "dbfs:/onboarding_tables_cdc/bronze",

Thanks.

Vosec commented 1 year ago

Im pointing to this source code for example: original_dataflow_df = self.spark.read.format("delta").load(dict_obj["silver_dataflowspec_path"]) If the tables in UC are stored in external location, then the users need READ FILES permission to this external location.

Could this be replaced with just spark.sql("select ...") to that table? So there wont be any permission issues and paths configuration?

ravi-databricks commented 1 year ago

@Vosec dlt-meta does not support UC, we will be making new release for UC support.

Vosec commented 1 year ago

@ravi-databricks Thank you very much for the response. Do you have rough estimate for the release?

doxie-king commented 11 months ago

What's the timeline on this getting added with support for UC?

ravi-databricks commented 10 months ago

AS of now there is working branch for [UC support] (https://github.com/databrickslabs/dlt-meta/tree/feature/dlt-meta-uc). we will plan to release UC branch in coming quarter.

ravi-databricks commented 6 months ago

Just update on UC feature: There is PR#28 which will be merged soon to main.

ravi-databricks commented 6 months ago

UC Support is released to main branch