apache / arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
https://arrow.apache.org/
Apache License 2.0
14.17k stars 3.46k forks source link

[Python] 17.0.0 wrongly installs multiple stray directories into site-packages #43299

Closed mgorny closed 21 hours ago

mgorny commented 1 month ago

Describe the bug, including details regarding any error messages, version, and platform.

$ pip install pyarrow
Collecting pyarrow
  Downloading pyarrow-17.0.0-cp312-cp312-manylinux_2_28_x86_64.whl.metadata (3.3 kB)
Collecting numpy>=1.16.6 (from pyarrow)
  Using cached numpy-2.0.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (60 kB)
Downloading pyarrow-17.0.0-cp312-cp312-manylinux_2_28_x86_64.whl (39.9 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 39.9/39.9 MB 11.0 MB/s eta 0:00:00
Using cached numpy-2.0.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (19.0 MB)
Installing collected packages: numpy, pyarrow
Successfully installed numpy-2.0.0 pyarrow-17.0.0
$ ls .venv/lib/python3.12/site-packages/
benchmarks     examples  numpy-2.0.0.dist-info  pip                   pyarrow                   scripts
cmake_modules  numpy     numpy.libs             pip-24.1.2.dist-info  pyarrow-17.0.0.dist-info

Note that it wrongly installed benchmarks, cmake_modules, examples and scripts as top-level Python packages.

Component(s)

Python

raulcd commented 1 month ago

Hi,

Thanks for reporting. It seems we added this bug when migrating from setup.py to pyproject.toml

timkpaine commented 1 month ago

duplicate of https://github.com/apache/arrow/issues/43280

jorisvandenbossche commented 21 hours ago

Issue resolved by pull request 43325 https://github.com/apache/arrow/pull/43325

mgorny commented 21 hours ago

Thanks!