Open contang0 opened 1 day ago
workaround in the meantime should be something like
def distinct(t, on):
aggs = {col: t[col].arbitrary() for col in t.columns if col not in on}
return t. group_by(on).agg(**aggs)
Thank you, will give it a try!
Is your feature request related to a problem?
At the moment Impala backend only supports .distinct() on a full table.
This works:
table.distinct()
This does not:
table.distinct(on=['col1', 'col2'])
What is the motivation behind your request?
This forces me to write verbose workarounds.
.distinct() on a subset of a table is pretty fundamental, in my view.
Describe the solution you'd like
The on clause in .distinct() should work.
What version of ibis are you running?
10.5
What backend(s) are you using, if any?
Impala
Code of Conduct