dbt-labs / dbt-external-tables

dbt macros to stage external sources
https://hub.getdbt.com/dbt-labs/dbt_external_tables/latest/
Apache License 2.0
314 stars 123 forks source link

Infer schema for BigQuery .parquet files #270

Closed LinasData closed 1 month ago

LinasData commented 7 months ago

Describe the feature

Currently it is mandatory to add data_type property when defining external tables columns. It would be nice for BigQuery to infer all columns as strings by default.

Describe alternatives you've considered

Tried this but it did not work. With CSV files works fine.

      # alternatively, BigQuery can infer your schema (columns + partitions)
      - name: event_inferred
        external:
          location: 'gs://bucket/path/*'
          options:
            format: parquet
            hive_partition_uri_prefix: 'gs://bucket/path/'

Who will this benefit?

Analytics engineers who work with Google Cloud services majority of the time.

github-actions[bot] commented 1 month ago

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please comment on the issue or else it will be closed in 7 days.

github-actions[bot] commented 1 month ago

Although we are closing this issue as stale, it's not gone forever. Issues can be reopened if there is renewed community interest. Just add a comment to notify the maintainers.

LinasData commented 1 month ago

Still interested. 🙏