snowplow / dbt-snowplow-utils

Snowplow utility functions to be used in conjunction with the snowplow-web dbt package.
Other
13 stars 6 forks source link

Add option to use distinct for get_string_agg macro #101

Closed agnessnowplow closed 1 year ago

agnessnowplow commented 1 year ago

Describe the feature

Currently the get_string_agg macro just sorts and aggregates data in a concatenated fashion, showing duplicates, if there are any. With the use of the keyword distinct for multiple warehouses or using collect_set instead of collect_list within databricks, the deduplication could be added if the user needs that for their specific use case.