data-mie / dbt-profiler

Macros for generating dbt model data profiles
Apache License 2.0
81 stars 33 forks source link

docblock is not outputted when executing `print_profile_docs` with BigQuery #76

Open toohsk opened 1 year ago

toohsk commented 1 year ago

Hi here,

I'm trying to include profile result with using print_profile_docs method but it seems I get different output.

When I refer to postgres I got following result.

$ POSTGRES_HOST=localhost dbt run-operation print_profile_docs --args '{"relation_name": "test_data_default"}' -t postgres
07:26:38  Running with dbt=1.5.1
07:26:38  Found 8 models, 25 tests, 0 snapshots, 0 analyses, 806 macros, 0 operations, 3 seed files, 0 sources, 0 exposures, 0 metrics, 0 groups
{% docs dbt_profiler__test_data_default  %}
| column_name          | data_type | row_count | not_null_proportion | distinct_proportion | distinct_count | is_unique | min        | max        |                avg | median |     std_dev_population |         std_dev_sample | ... |
| -------------------- | --------- | --------- | ------------------- | ------------------- | -------------- | --------- | ---------- | ---------- | ------------------ | ------ | ---------------------- | ---------------------- | --- |
| id                   | integer   |         5 |                 1.0 |                 1.0 |              5 |      True | 1          | 5          | 3.0000000000000000 |      3 | 1.41421356237309500000 | 1.58113883008418970000 | ... |
| numeric_not_nullable | integer   |         5 |                 1.0 |                 0.6 |              3 |     False | 1          | 3          | 1.8000000000000000 |      2 | 0.74833147735478827712 | 0.83666002653407554798 | ... |
| numeric_nullable     | integer   |         5 |                 0.6 |                 0.4 |              2 |     False | 1          | 3          | 1.6666666666666667 |      1 | 0.94280904158206336587 | 1.15470053837925150000 | ... |
| string_not_nullable  | text      |         5 |                 1.0 |                 0.6 |              3 |     False |            |            |                    |        |                        |                        | ... |
| string_nullable      | text      |         5 |                 0.6 |                 0.4 |              2 |     False |            |            |                    |        |                        |                        | ... |
| date_nullable        | date      |         5 |                 0.6 |                 0.2 |              1 |     False | 2022-01-01 | 2022-01-01 |                    |        |                        |                        | ... |
| bool_nullable        | boolean   |         5 |                 0.8 |                 0.4 |              2 |     False |            |            | 0.6000000000000000 |        |                        |                        | ... |
{% enddocs %}

But when I refer to BigQuery I got following result.

$ dbt run-operation print_profile_docs --args '{"relation_name": "customers"}' -t dev
07:29:00  Running with dbt=1.5.0
07:29:01  Found 3 models, 9 tests, 0 snapshots, 0 analyses, 407 macros, 0 operations, 3 seed files, 0 sources, 0 exposures, 0 metrics, 0 groups
| column_name            | data_type | row_count | not_null_proportion | distinct_proportion | distinct_count | is_unique | min        | max        |                 avg |  std_dev_population |      std_dev_sample | profiled_at                   |
| ---------------------- | --------- | --------- | ------------------- | ------------------- | -------------- | --------- | ---------- | ---------- | ------------------- | ------------------- | ------------------- | ----------------------------- |
| customer_id            | int64     |       100 |                1.00 |                1.00 |            100 |      True | 1          | 100        | 50.5000000000000100 | 28.8660700477221250 | 29.0114919758820240 | 2023-06-21 07:29:06.504541+00 |
| first_name             | string    |       100 |                1.00 |                0.79 |             79 |     False |            |            |                     |                     |                     | 2023-06-21 07:29:06.504541+00 |
| last_name              | string    |       100 |                1.00 |                0.19 |             19 |     False |            |            |                     |                     |                     | 2023-06-21 07:29:06.504541+00 |
| first_order_date       | date      |       100 |                0.62 |                0.46 |             46 |     False | 2018-01-01 | 2018-04-07 |                     |                     |                     | 2023-06-21 07:29:06.504541+00 |
| most_recent_order_date | date      |       100 |                0.62 |                0.52 |             52 |     False | 2018-01-09 | 2018-04-09 |                     |                     |                     | 2023-06-21 07:29:06.504541+00 |
| number_of_orders       | int64     |       100 |                1.00 |                0.05 |              5 |     False | 0          | 5          |  0.9900000000000002 |  0.9848350115628506 |  0.9897964284119083 | 2023-06-21 07:29:06.504541+00 |

The biggest difference is that docblock aren't output. Is there anything I have to do? Thanks.