tuva-health / tuva_demo

A starter dbt project and synthetic claims dataset for trying out the Tuva Project.
https://thetuvaproject.com/
Apache License 2.0
14 stars 17 forks source link

Errors when running dbt deps #24

Closed Gonzalo933 closed 1 year ago

Gonzalo933 commented 1 year ago

I tried to follow the instructions here to set up the demo but after running dbt deps I run into some errors:

CaughtMacroErrorWithNodeError
Compilation Error in seed terminology__icd_10_cm (seeds/terminology/terminology__icd_10_cm.csv)
  macro 'dbt_macro__snowflake__load_seed' takes not more than 1 argument(s)

  > in macro load_seed (macros/load_seed.sql)
  > called by seed terminology__icd_10_cm (seeds/terminology/terminology__icd_10_cm.csv)
CaughtMacroErrorWithNodeError

it seems that there has been some changes in the tuva project macros and the post hooks no longer work?

Changing the version in dbt_packages.yml to:

packages:
  - package: tuva-health/the_tuva_project
    version: [">=0.3.0","<=0.3.16"]

solves the problem. I'm using dbt cloud v1.6

Gonzalo933 commented 1 year ago

Forcing version 0.3.16 seems to create problems when running dbt build

15:34:45 Database Error in seed terminology__provider (seeds/terminology/terminology__provider.csv)
  100080 (22000): Number of columns in file (17) does not match that of the corresponding table (15), use file format option error_on_column_count_mismatch=false to ignore this error
    File 'provider_data/provider.csv_0_1_1.csv.gz', line 2, character 1
    Row 1 starts at line 1, column "PROVIDER"[17]
    If you would like to continue loading when an error is encountered, use other values such as 'SKIP_FILE' or 'CONTINUE' for the ON_ERROR option. For more information on loading options, please run 'info loading_data' in a SQL client.
  compiled Code at target/run/the_tuva_project/seeds/terminology/terminology__provider.csv
sarah-tuva commented 1 year ago

Hi @Gonzalo933 thank you so much for reporting this bug. I'm looking into it now.

sarah-tuva commented 1 year ago

@Gonzalo933 I just pushed a bug fix. Let me know if you are still running into issues with this.

Gonzalo933 commented 1 year ago

This issue is fixed! Thank you.

I keep running into the other issue when loading one of the seeds. I can open a new issue with it

13:17:04 Database Error in seed terminology__provider (seeds/terminology/terminology__provider.csv)
  100080 (22000): Number of columns in file (17) does not match that of the corresponding table (15), use file format option error_on_column_count_mismatch=false to ignore this error
    File 'provider_data/provider.csv_0_1_0.csv.gz', line 2, character 1
    Row 1 starts at line 1, column "PROVIDER"[17]
    If you would like to continue loading when an error is encountered, use other values such as 'SKIP_FILE' or 'CONTINUE' for the ON_ERROR option. For more information on loading options, please run 'info loading_data' in a SQL client.
  compiled Code at target/run/the_tuva_project/seeds/terminology/terminology__provider.csv
sarah-tuva commented 1 year ago

@Gonzalo933 we recently made some data model changes to the Provider seed. Make sure you are running with --full-refresh. That should solve the issue. Let us know if it doesn't!