deanmalmgren / textract

extract text from any document. no muss. no fuss.
http://textract.readthedocs.io
MIT License
3.89k stars 599 forks source link

Error in textract setup command w/ extract-msg<=0.29.* due to Wheel 0.40.0 #461

Open seankfh opened 1 year ago

seankfh commented 1 year ago

Textract cannot be installed with the current version of Wheel (0.40.0) because https://github.com/pypa/wheel/issues/520 made it so that .* suffix can only be used with == or != operators. I receive the following error:

Collecting textract==1.6.4
  Downloading textract-1.6.4.tar.gz (17 kB)
  Preparing metadata (setup.py): started
  Preparing metadata (setup.py): finished with status 'error'
  error: subprocess-exited-with-error

  × python setup.py egg_info did not run successfully.
  │ exit code: 1
  ╰─> [3 lines of output]
      error in textract setup command: 'install_requires' must be a string or list of strings containing valid project/version requirement specifiers; Expected end or semicolon (after version specifier)
          extract-msg<=0.29.*
                     ~~~~~~^
      [end of output]

I see forks where people have tried to drop the Python 2 dependencies and work around this error. Can we get consensus to move forward with one of these? Has anyone in the community found a solution or alternative library? Locally I can pin to an earlier version of wheel but this is breaking our serverless build where we have less control over the infrastructure.

seankfh commented 1 year ago

@traverseda are you or @deanmalmgren available to consider removing the asterisk and comply with the latest wheel constraints? Please let me know how I can help. Thank you!

chenrui333 commented 1 year ago

also relates to https://github.com/Homebrew/homebrew-core/pull/132584 (we have some regression build failure on this)