vaexio / vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
https://vaex.io
MIT License
8.27k stars 590 forks source link

[BUG-REPORT] Runtime error occurs when joining dataframes #2319

Open vignesh-bungee opened 1 year ago

vignesh-bungee commented 1 year ago

Hi Vaex Team,

Description When joining two dataframes, getting runtime error saying Oops, get an empty chunk, from 33112 to 33112, that should not happen. First dataframe df1 is filtered and aggregated. Second dataframe df2 is simply the source dataframe filtered. Size of both left and right dataframes is 33112. I am using c5.x large EC2 instance. Can you please confirm if it is a bug or I am missing any parameter to be passed ?

Software information

Vaex version 4.14.0 Vaex was installed via: pip OS: Linux Additional information Jupyter notebook and sample data attached

Additional information Error Log Attached image