vaexio / vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
https://vaex.io
MIT License
8.23k stars 590 forks source link

[BUG-REPORT] min aggregation returns inf when sorting and delay Flags are set to True #2235

Open vignesh-bungee opened 1 year ago

vignesh-bungee commented 1 year ago

Hi Vaex Team, I am trying to find the minimum value of a column comp_min_price. Grouping is done with respect to 5 columns. I am getting inf as the minimum value when sort and delay flags are set to True and on the other hand if the sorting is set to False, I am getting the current minimum value. It seems to be an issue. Can you please confirm ?

Software information

Vaex version 4.9.0 Vaex was installed via: pip OS: Linux

Additional information Input Data Description, Input Data and Issue Snapshot are attached

Input Data Description: image

Input Data and Issue screenshot: image

JovanVeljanoski commented 1 year ago

Hmm.. interesting. Thank you very much for the report - we will try our best to figure this out.

Is it possible for you to provide a minimum reproducible example (can be fake / random data), something we can copy paste and run on our end - will be a great help.

Thanks!

vignesh-bungee commented 1 year ago

image