CatchTheTornado / pdf-extract-api

Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
https://demo.doctractor.com
GNU General Public License v3.0
1.38k stars 92 forks source link

docker部署,在fastapi中正常 #21

Closed PoleGeogry closed 2 weeks ago

PoleGeogry commented 3 weeks ago

2024-11-05 16:57:34 warnings.warn( 2024-11-05 16:57:34 2024-11-05 16:57:37 [2024-11-05 08:57:37,534: WARNING/ForkPoolWorker-11] Detecting bboxes: 0%| | 0/2 [00:00<?, ?it/s] 2024-11-05 16:58:17 [2024-11-05 08:58:17,382: ERROR/MainProcess] Process 'ForkPoolWorker-11' pid:48 exited with 'signal 9 (SIGKILL)' 2024-11-05 16:58:17 /usr/local/lib/python3.10/multiprocessing/resource_tracker.py:224: UserWarning: resource_tracker: There appear to be 1 leaked semaphore objects to clean up at shutdown 2024-11-05 16:58:17 warnings.warn('resource_tracker: There appear to be %d ' 2024-11-05 16:58:17 [2024-11-05 08:58:17,399: ERROR/MainProcess] Task handler raised error: WorkerLostError('Worker exited prematurely: signal 9 (SIGKILL) Job: 1.') 2024-11-05 16:58:17 Traceback (most recent call last): 2024-11-05 16:58:17 File "/usr/local/lib/python3.10/site-packages/billiard/pool.py", line 1265, in mark_as_worker_lost 2024-11-05 16:58:17 raise WorkerLostError( 2024-11-05 16:58:17 billiard.einfo.ExceptionWithTraceback: 2024-11-05 16:58:17 """ 2024-11-05 16:58:17 Traceback (most recent call last): 2024-11-05 16:58:17 File "/usr/local/lib/python3.10/site-packages/billiard/pool.py", line 1265, in mark_as_worker_lost 2024-11-05 16:58:17 raise WorkerLostError( 2024-11-05 16:58:17 billiard.exceptions.WorkerLostError: Worker exited prematurely: signal 9 (SIGKILL) Job: 1.