Closed dioptre closed 4 months ago
Hi @dioptre! Thanks for raising this issue! Could you share a code snippet of causes this error to happen?
Based off the line number it looks like it may be an in-memory scan?
I run:
daft.read_parquet(
[parquets in s3 array], use_native_downloader=True
).to_arrow()
could you also run
df = daft.read_parquet(
[parquets in s3 array],
use_native_downloader=True
)
df.explain()
We get a segmentation fault, so that won't be possible.
Please know that we are getting successes then failures on the same files!
Describe the bug Physical plan breaking: thread '' panicked at 'no entry found for key', src/daft-plan/src/physical_plan.rs:386:28
To Reproduce Selecting multiple parquet files from s3 doing daft.read_parquet with native downloader
Expected behavior Working download
Desktop (please complete the following information):
Additional context version 0.2.12
I'm blocked by using daft due to the unreliability - please help!