apache / arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
https://arrow.apache.org/
Apache License 2.0
14.56k stars 3.54k forks source link

[Python] Not able to import pyarrow.parquet #43126

Open Akhil-77 opened 4 months ago

Akhil-77 commented 4 months ago

Describe the usage question you have. Please include as many useful details as possible.

I am trying to load datasets library from Hugging Face but it gives me this error.

ImportError: The pyarrow installation is not built with support for the Parquet file format (DLL load failed while importing _parquet: The specified procedure could not be found.)

so I tried the following,

import pyarrow.parquet as pq

Which gave me the same error.

My pip is up to date and PyArrow is also up to date so why am I facing this issue. Can anyone help me with this??

Component(s)

Parquet

mapleFU commented 4 months ago

Out of curiousity what's the version of arrow you're using?

urucoder commented 2 months ago

is it the pip package, or a Conda package?

I faced this issue in a Conda environment, when installed pyarrow-core=17.0.0=py310h85d79f8_1_cpu from conda-forge, and I solved it by installing libarrow=17.0.0=h974b3c2_11_cpu