camelot-dev / excalibur

A web interface to extract tabular data from PDFs
https://excalibur-py.readthedocs.io
MIT License
1.55k stars 226 forks source link

Ghostscript Error, Stuck in Processing #117

Open arky opened 4 years ago

arky commented 4 years ago
Using the latest master, we can not extract data on latest Ubuntu 64 (LTS). The Excalibur is stuck in 'Processing Step'. Using Ghostscript version (9.50~dfsg-5ubuntu4.1). ``` 127.0.0.1 - - [06/Aug/2020 12:54:49] "GET /workspaces/eee2199b-6de1-4d36-8131-abc25fcca6cf HTTP/1.1" 200 - GPL Ghostscript 9.50: Unrecoverable error, exit code 1 ERROR:root:-100 Traceback (most recent call last): File "Code/Camelot-dev/excalibur/.venv/lib/python3.8/site-packages/excalibur_py-0.4.3-py3.8.egg/excalibur/tasks.py", line 56, in split with Ghostscript(*gs_call, stdout=null) as gs: File "Code/Camelot-dev/excalibur/.venv/lib/python3.8/site-packages/camelot_py-0.8.2-py3.8.egg/camelot/ext/ghostscript/__init__.py", line 90, in Ghostscript return __Ghostscript( File "Code/Camelot-dev/excalibur/.venv/lib/python3.8/site-packages/camelot_py-0.8.2-py3.8.egg/camelot/ext/ghostscript/__init__.py", line 39, in __init__ rc = gs.init_with_args(instance, args) File "Code/Camelot-dev/excalibur/.venv/lib/python3.8/site-packages/camelot_py-0.8.2-py3.8.egg/camelot/ext/ghostscript/_gsprint.py", line 174, in init_with_args raise GhostscriptError(rc) 127.0.0.1 - - [06/Aug/2020 12:54:50] "GET /workspaces/eee2199b-6de1-4d36-8131-abc25fcca6cf HTTP/1.1" 200 - 127.0.0.1 - - [06/Aug/2020 12:54:51] "GET /workspaces/eee2199b-6de1-4d36-8131-abc25fcca6cf HTTP/1.1" 200 - 127.0.0.1 - - [06/Aug/2020 12:54:51] "GET /workspaces/eee2199b-6de1-4d36-8131-abc25fcca6cf HTTP/1.1" 200 - 127.0.0.1 - - [06/Aug/2020 12:54:52] "GET /workspaces/eee2199b-6de1-4d36-8131-abc25fcca6cf HTTP/1.1" 200 - ``` ![Screenshot from 2020-08-06 12-58-42](https://user-images.githubusercontent.com/808895/89496463-96c17e80-d7e4-11ea-89f3-91e8e8ad9035.png)
sidesteps commented 4 years ago

web interface gets stuck in infinite refresh loop when extracting any PDF.

aakashlpin commented 4 years ago

Was trying to Dockerize and host excalibur and ran into this same issue. This makes the web interface non-deployable as well. Draft PR https://github.com/camelot-dev/excalibur/pull/124

vinayak-mehta commented 4 years ago

Thanks for working on the PR! I'm getting closer to ghostscript in terms of todos and will try to look into this issue and the associated PR today.

michellebao commented 3 years ago

Still running into the infinite refresh loop error :(

michellebao commented 3 years ago

^^ Nevermind, resolved by cloning what was on master instead of using the executable from latest release.

TomProvence commented 3 years ago

Thanks for working on the PR! I'm getting closer to ghostscript in terms of todos and will try to look into this issue and the associated PR today.

Hey brother,have you fixed this exception with camelot.ext.ghostscript._gsprint.GhostscriptError: -100? I had meet this problem when I try to deal with two or more files at the same time

vinayak-mehta commented 3 years ago

I'll look into this in the next couple of months.

Oaklight commented 2 years ago

any update?

TomProvence commented 1 year ago

你好!邮件已经收到,我会尽快回复。谢谢

munish0838 commented 1 year ago

Any possible solution yet?

TomProvence commented 1 year ago

你好!邮件已经收到,我会尽快回复。谢谢

arky commented 1 year ago

This project is no longer actively maintained.