bhaskatripathi / pdfGPT

PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
https://huggingface.co/spaces/bhaskartripathi/pdfGPT_Turbo
MIT License
6.98k stars 837 forks source link

pip install seems to take a long time #55

Open viveknirkhe opened 1 year ago

viveknirkhe commented 1 year ago

Did anyone see this issue? For me, it has been going on for a long time.

jacobyoby commented 1 year ago

Can you provide more info?

webyacusa commented 1 year ago

It is happening to me as well. This is what I did, running on a Mac running Ventura with the M1 chipset:

  1. checked out the latest code from the repository
  2. Downloaded the Universal Sentence Encoder, extracted the files to the location specified in the instructions
  3. Edited the line in api.py with the local reference (the self.use = hub.load('./Universal Sentence Encoder/') line)
  4. Executed from the CLI docker-compose -f docker-compose.yaml up

As a result, the building seems to be running, but it is taking a very long time. I am already at 5000 seconds in the build. A couple of observations: first, it does not seems like it is hanged, from time to time it does seem to be downloading and running things. Second, I got this message:

=> => # INFO: pip is looking at multiple versions of pydub to determine which version is compatible wit => => # h other requirements. This could take a while.
=> => # Downloading pydub-0.24.0-py2.py3-none-any.whl (30 kB)
=> => # INFO: This is taking longer than usual. You might need to provide the dependency resolver with => => # stricter constraints to reduce runtime. See https://pip.pypa.io/warnings/backtracking for guida => => # nce. If you want to abort this run, press Ctrl + C.

Am I missing something?

jacobyoby commented 1 year ago

I will pull the container and specify a specific version. JacobySent from the void. On May 9, 2023, at 07:57, webyacusa @.***> wrote: It is happening to me as well. This is what I did, running on a Mac running Ventura with the M1 chipset:

checked out the latest code from the repository Downloaded the Universal Sentence Encoder, extracted the files to the location specified in the instructions Edited the line in api.py with the local reference (the self.use = hub.load('./Universal Sentence Encoder/') line) Executed from the CLI docker-compose -f docker-compose.yaml up

As a result, the building seems to be running, but it is taking a very long time. I am already at 5000 seconds in the build. A couple of observations: first, it does not seems like it is hanged, from time to time it does seem to be downloading and running things. Second, I got this message:

=> => # INFO: pip is looking at multiple versions of pydub to determine which version is compatible wit => => # h other requirements. This could take a while. => => # Downloading pydub-0.24.0-py2.py3-none-any.whl (30 kB) => => # INFO: This is taking longer than usual. You might need to provide the dependency resolver with => => # stricter constraints to reduce runtime. See https://pip.pypa.io/warnings/backtracking for guida => => # nce. If you want to abort this run, press Ctrl + C.

Am I missing something?

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: @.***>

kwong-dev commented 1 year ago

Hello, Were you able to find what needed to be changed on certain package versions in the requirements.txt? I'm seeing the exact same problem when trying to use docker-compose -f docker-compose.yaml. I also see the same issue when trying to run docker build . as well.

I'm also using an M1 mac and when I try to run code a pip3 install -r requirements.txt in a virtual environment, it complains about being unable to find tensorflow versions:

Collecting PyMuPDF
  Downloading PyMuPDF-1.22.3-cp310-cp310-macosx_11_0_arm64.whl (12.7 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 12.7/12.7 MB 7.3 MB/s eta 0:00:00
Collecting numpy
  Using cached numpy-1.24.3-cp310-cp310-macosx_11_0_arm64.whl (13.9 MB)
Collecting scikit-learn
  Using cached scikit_learn-1.2.2-cp310-cp310-macosx_12_0_arm64.whl (8.5 MB)
ERROR: Could not find a version that satisfies the requirement tensorflow>=2.0.0 (from versions: 2.13.0rc0)
ERROR: No matching distribution found for tensorflow>=2.0.0

So this is likely a mac / tensorflow incompatility issue.

But when I try to run on a Windows11 machine, to run pip3 install -r requirements.txt on python 3.8.2 it will simply complain about being unable to find a version of langchain-serve that work with the rest of the dependencies.

ERROR: Cannot install -r requirements.txt (line 8) because these package versions have conflicting dependencies.

The conflict is caused by:
    langchain-serve 0.0.32 depends on pandasai
    langchain-serve 0.0.31 depends on pandasai
    langchain-serve 0.0.30 depends on pandasai
    langchain-serve 0.0.29 depends on pandasai
    langchain-serve 0.0.28 depends on pandasai
    langchain-serve 0.0.27 depends on pandasai
    langchain-serve 0.0.26 depends on pandasai
    langchain-serve 0.0.25 depends on pandasai
    langchain-serve 0.0.24 depends on pandasai
    langchain-serve 0.0.23 depends on pandasai

I'm hoping to be able to run this locally or in a docker container. Let me know if I'm missing anything.

Thanks!

jacobyoby commented 1 year ago

It’s an OS issue. It builds in Windows. I haven’t found the specific issue yet. Sent from the void. On May 21, 2023, at 21:17, Kenneth Wong @.***> wrote: Hello, Were you able to find what needed to be changed on certain package versions in the requirements.txt? I'm seeing the exact same problem when trying to use docker-compose -f docker-compose.yaml. I also see the same issue when trying to run docker build . as well. I'm also using an M1 mac and when I try to run code a pip3 install -r requirements.txt in a virtual environment, it complains about being unable to find tensorflow versions: Collecting PyMuPDF Downloading PyMuPDF-1.22.3-cp310-cp310-macosx_11_0_arm64.whl (12.7 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 12.7/12.7 MB 7.3 MB/s eta 0:00:00 Collecting numpy Using cached numpy-1.24.3-cp310-cp310-macosx_11_0_arm64.whl (13.9 MB) Collecting scikit-learn Using cached scikit_learn-1.2.2-cp310-cp310-macosx_12_0_arm64.whl (8.5 MB) ERROR: Could not find a version that satisfies the requirement tensorflow>=2.0.0 (from versions: 2.13.0rc0) ERROR: No matching distribution found for tensorflow>=2.0.0

So this is likely a mac / tensorflow incompatility issue. But when I try to run on a Windows11 machine, to run pip3 install -r requirements.txt on python 3.8.2 it will simply complain about being unable to find a version of langchain-serve that work with the rest of the dependencies. ERROR: Cannot install -r requirements.txt (line 8) because these package versions have conflicting dependencies.

The conflict is caused by: langchain-serve 0.0.32 depends on pandasai langchain-serve 0.0.31 depends on pandasai langchain-serve 0.0.30 depends on pandasai langchain-serve 0.0.29 depends on pandasai langchain-serve 0.0.28 depends on pandasai langchain-serve 0.0.27 depends on pandasai langchain-serve 0.0.26 depends on pandasai langchain-serve 0.0.25 depends on pandasai langchain-serve 0.0.24 depends on pandasai langchain-serve 0.0.23 depends on pandasai

I'm hoping to be able to run this locally or in a docker container. Let me know if I'm missing anything. Thanks!

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: @.***>

dnabanita7 commented 1 year ago

It works perfectly fine on linux based systems. IT gives error in mac based systems cuz of tensorflow package version onflict with grpcio. I will open an issue regarding the same.

bhaskoro-muthohar commented 1 year ago

Hello, Were you able to find what needed to be changed on certain package versions in the requirements.txt? I'm seeing the exact same problem when trying to use docker-compose -f docker-compose.yaml. I also see the same issue when trying to run docker build . as well.

I'm also using an M1 mac and when I try to run code a pip3 install -r requirements.txt in a virtual environment, it complains about being unable to find tensorflow versions:

Collecting PyMuPDF
  Downloading PyMuPDF-1.22.3-cp310-cp310-macosx_11_0_arm64.whl (12.7 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 12.7/12.7 MB 7.3 MB/s eta 0:00:00
Collecting numpy
  Using cached numpy-1.24.3-cp310-cp310-macosx_11_0_arm64.whl (13.9 MB)
Collecting scikit-learn
  Using cached scikit_learn-1.2.2-cp310-cp310-macosx_12_0_arm64.whl (8.5 MB)
ERROR: Could not find a version that satisfies the requirement tensorflow>=2.0.0 (from versions: 2.13.0rc0)
ERROR: No matching distribution found for tensorflow>=2.0.0

So this is likely a mac / tensorflow incompatility issue.

But when I try to run on a Windows11 machine, to run pip3 install -r requirements.txt on python 3.8.2 it will simply complain about being unable to find a version of langchain-serve that work with the rest of the dependencies.

ERROR: Cannot install -r requirements.txt (line 8) because these package versions have conflicting dependencies.

The conflict is caused by:
    langchain-serve 0.0.32 depends on pandasai
    langchain-serve 0.0.31 depends on pandasai
    langchain-serve 0.0.30 depends on pandasai
    langchain-serve 0.0.29 depends on pandasai
    langchain-serve 0.0.28 depends on pandasai
    langchain-serve 0.0.27 depends on pandasai
    langchain-serve 0.0.26 depends on pandasai
    langchain-serve 0.0.25 depends on pandasai
    langchain-serve 0.0.24 depends on pandasai
    langchain-serve 0.0.23 depends on pandasai

I'm hoping to be able to run this locally or in a docker container. Let me know if I'm missing anything.

Thanks!

I think you can try this https://github.com/bhaskatripathi/pdfGPT/issues/10#issuecomment-1518678791