Currently the get_string_agg macro just sorts and aggregates data in a concatenated fashion, showing duplicates, if there are any. With the use of the keyword distinct for multiple warehouses or using collect_set instead of collect_list within databricks, the deduplication could be added if the user needs that for their specific use case.
Describe the feature
Currently the
get_string_agg
macro just sorts and aggregates data in a concatenated fashion, showing duplicates, if there are any. With the use of the keyworddistinct
for multiple warehouses or usingcollect_set
instead ofcollect_list
within databricks, the deduplication could be added if the user needs that for their specific use case.