Open barak1412 opened 4 months ago
Description
As described in the title, HDFS support for the
scan_parquet
function will be welcomed.The aleternative,
scan_pyarrow_dataset
is not enough since it doesn't support streaming.Any
fsspec
fallback is an option?Thanks in advance.
+1
Might be possible with: https://github.com/Kimahriman/hdfs-native
@ion-elgreco indeed!
Is this something polars maintainers see as valuable addition?
@santosh-d3vpl3x I sure they are. @ion-elgreco How much effort would it take?
Apparently there is accepted
tag that indicates whether feature is accepted or not.
Now, I am not sure what does it take for this feature to get that tag before we sink in a lot of efforts just for it to not get accepted. Perhaps we should discuss the feasibility and the possible approaches to make ourselves confident.
Usually, @ritchie46 performs a triage as I have heard from the polars discord.
I understand. Do you familiar with Polars' object store code?
Description
As described in the title, HDFS support for the
scan_parquet
function will be welcomed.The alternative,
scan_pyarrow_dataset
is not enough since it doesn't support streaming.Any
fsspec
fallback is an option?Thanks in advance.