techascent / tech.ml.dataset

A Clojure high performance data processing system
Eclipse Public License 1.0
660 stars 33 forks source link

group-by categories #50

Closed genmeblog closed 4 years ago

genmeblog commented 4 years ago

Is there any way to get the list of the keys from group-by operation without actual grouping? I can do group-by and then call keys but I'm not sure if it's efficient way.

cnuernber commented 4 years ago

The list of keys would be the set of unique items from either the column or subset of columns you are using for the operation. There is currently no efficient (less than O(row-count*included-column-count)) way.

cnuernber commented 4 years ago

Is this addressed by your new API?

genmeblog commented 4 years ago

Yes, it's done there.

cnuernber commented 4 years ago

Addressed by https://github.com/scicloj/tablecloth