treeverse / lakeFS

lakeFS - Data version control for your data lake | Git for data
https://docs.lakefs.io
Apache License 2.0
4.46k stars 359 forks source link

Azure unity export - "table already exists" #8292

Open offirc2 opened 1 month ago

offirc2 commented 1 month ago

What happened?

For Unity Catalog Export functionality in Azure Databricks, create external table in Unity Catalog fails first time with "table already exists" error message but it succeeds in 2nd attempt.

C836CB0E-2D2C-4255-ABFB-64D46E6D587A (3) DEA69998-8B5F-4CFA-AF0A-71CF96E212DC (2)

It seems to basically the first table, the first time it tries the command, it says it failed because the table already exists. But then when we look at the command text, it runs it a second time and then it succeeds the second time around. The other thing is obviously, if we have a bunch of tables being updated at the same time and it generates a whole bunch of actions doing the same thing and some of them end up colliding and overlapping and they all fail obviously.

Bottom line - it’s functional, but its just generating a lot of errors and exceptions and failures, things like that get logged kind of mostly on the lakeFS side, i guess in the action logs, but also on the sql warehouse, we can see obviously all those failed commands.