vaexio / vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
https://vaex.io
MIT License
8.22k stars 589 forks source link

[BUG-REPORT] why the same dataframe and same groupby(vaex.agg.list) operation, one got error and one got correct if i export to hdf5 and loaded it back #2408

Open Yihao120 opened 6 months ago

Yihao120 commented 6 months ago

version 4.17.1

error

df.groupby(by=['a', 'b', 'c']).agg({'res1': vaex.agg.list('d')}) 图片

correct

df.export_hdf5('tmp.hdf5') df = vaex.open('tmp.hdf5') df.groupby(by=['a', 'b', 'c']).agg({'res1': vaex.agg.list('d')})