visualfabriq / bquery

A query and aggregation framework for Bcolz (W2013-01)
https://www.visualfabriq.com
BSD 3-Clause "New" or "Revised" License
56 stars 11 forks source link

I sorted count distinct v2 #32

Closed FrancescElies closed 9 years ago

FrancescElies commented 9 years ago

Issue

ct = bq.open(ct_rootdir, mode='r')
barr = ct.where_terms( [('d30106', 'in', [4])] )
ct.groupby( [] , ['a30501'] , bool_arr=barr, agg_method='sorted_count_distinct')
CarstVaartjes commented 9 years ago

So the issue is when you do a sorted count distinct without a groupby column?

FrancescElies commented 9 years ago

exactly, the problem does not reside within the count distinct itself but with the way the output is formatted, I am making some additional tests at the moment, the problem still has some rough edges, I'll write back once is fully fixed

FrancescElies commented 9 years ago

After adding some tests to the suit and doing some manual checks too, I believe this issue is now to be solved

FrancescElies commented 9 years ago

Please let me know if I could merge this

CarstVaartjes commented 9 years ago

Go ahead, I might make it a bit more "smooth" later on but at least it stop-gaps the issue :)

FrancescElies commented 9 years ago

:+1: