Closed bdewilde closed 1 year ago
I'll work on a PR for updating pdfestrian, and make sure it all works.
@samanz not a big deal because i've not added branch protections yet, but in future PRs, please don't hesitate to approve/request changes rather than just comment. (unless only commenting is your intention, of course.) hopefully that will help speed up review cycles, especially since our dev hours are bound to be on different schedules for this project. sound good?
changes
celery
from3.1.25
to~4.0
dedupe
from>=1.4.14
to~1.4.14
itsdangerous
from>=0.24
to>=2.0.0,<2.1.0
psycopg2
from>=2.7.0
to~=2.9.0
redis
from==2.10.5
to~=3.2.0
spacy
from>=1.2.0,<2.0.0
to~=2.0
textacy
from>=0.3.2,<0.5.0
to==0.5.0
webargs
from>=1.5.1
to~=1.5
textacy
usage in code to agree with version bump.gitignore
to include standard python packaging cruftcontext
afaict, on the production machine, we are currently running Python 3.6.12 and PostgreSQL 9.6.11 with the following dependencies (filtered to just direct, not transitive, deps)
i wasn't able to reproduce this environment locally, owing to changes in supported versions of things (e.g. PY3.6 no longer supported properly via homebrew installs), so i had to incrementally bump / constrain things until everything got working.
questions
[2023-04-17 21:26:44,208] ERROR in errors: an unhandled exception occurred: Command '['/Users/burtondewilde/Desktop/projects/datakind__colandr/permanent-colandr-back/pdfestrian/bin/extractText.sh', '--filename', '/Users/burtondewilde/Desktop/projects/datakind__colandr/permanent-colandr-back/colandr_data/fulltexts/1/44.pdf']' returned non-zero exit status 1.