DLT Dataset (Lending Tree)?

databricks-demos / dbdemos

Demos to implement your Databricks Lakehouse

Other

273 stars 87 forks source link

DLT Dataset (Lending Tree)? #13

Closed Fallinggator closed 1 year ago

Fallinggator commented 1 year ago

I tried to use the full DLT demo, but ran into a few problems.

Had to manually alter the cluster it tried to create to use a cluster policy since required by my org. (Resolved, but would be nice if aspects like that were easily configurable with args)
Failed to load the lending tree dataset csv. I couldn't find it anywhere in the workspace samples or in the repo. I looked on Kaggle, but there are too many potential matches. No idea which of the lending tree data sets it is expecting.

karthik25-gk commented 1 year ago

Having the same issue with the dataset csv. Can I know where I can download from.

QuentinAmbard commented 1 year ago

ok I'll have a look, must be a bundle issue

QuentinAmbard commented 1 year ago

@Fallinggator I'm not sure I'm looking at the right place, are you saying you don't have this dataset in your workspace ?

spark.read.csv('/databricks-datasets/lending-club-loan-stats', header=True) ?

Can you confirm which demo you're loading and where it's failing exactly ? Thanks

karthik25-gk commented 1 year ago

@QuentinAmbard Ignore my issue. It’s been resolved. Thanks

QuentinAmbard commented 1 year ago

Ok I understand what happened, becuase you can't create the cluster, the init job did not start, and the data wasn't generated. I'll instead start the init job with the current cluster, it'll be faster & solve these kind of issue.

QuentinAmbard commented 1 year ago

Hi @Fallinggator , we released a new option that you can enable in the intallation: use_current_cluster = True

You should now have a better error and a suggestion for you to use this option if we can't install a new cluster:

This should solve your 2 issues !

Longer term it'll likely become the default option

Feel free to reopen or let us know if you're still facing some issue while using this new option!