Open slvnwhrl opened 1 year ago
Hi, thank you for the report. This appears to be a bug indeed. I've opened a PR with a test, and I hope we can fix this soon.
In the meantime, a simple workaround would be to bass the boolean expression right to the selection
argument:
df = vaex.example()
pandas_generator = df.to_pandas_df(selection=df.x > 5 chunk_size=20)
for i in pandas_generator:
print(i)
Thanks!
Description When converting a vaex.DataFrame to a pandas.DataFrame with selection=True and a positive chunk_size, the following error occurs:
RuntimeError: Oops, requesting column default from dataset, but it does not exist
.Software information
import vaex; vaex.__version__)
: {'vaex-core': '4.16.0', 'vaex-viz': '0.5.4', 'vaex-hdf5': '0.14.1', 'vaex-server': '0.8.1', 'vaex-astro': '0.9.3', 'vaex-jupyter': '0.8.1', 'vaex-ml': '0.18.1'}Additional information To reproduce the error, I am providing the following example:
Producing this traceback:
Edit: After some more trying I found that it works as expected if
parallel
is set toFalse
. However, I don't assume that it should only work that ways as it is not mentioned in the documentation.