Open diegodewilde opened 1 year ago
@diegodewilde I've thought about adding a "mode" (most common value) profiling metric to the package but never around to implementing it. This proposal expands the mode concept into N most common values and I think it's a good idea.
Just throwing thoughts here:
top_1_value
, top_1_value_proportion
, top_2_value
, top_2_value_distribution
, etc?Would you be interested in implementing this? :)
Hi stumelius,
@diegodewilde Circling back to this. Is this feature still in your interests and if so, would you like to contribute? :)
Hi,
I was looking at this project and I must say: it's awesome and something that dbt docs currently is missing.
One thing got in my mind is the question why there's not an option to add the TOP x column values and their distribution? Is there any other reason to not include this in the docs?
Like in this example where you show TOP 2 for example:
Looking forward to your thoughts!