allenai / papermage

library supporting NLP and CV research on scientific papers
https://papermage.org
Apache License 2.0
692 stars 54 forks source link

Integrate PDFPlumber Parser into papermage #2

Closed bnewm0609 closed 1 year ago

bnewm0609 commented 1 year ago

This PR integrates the pdfplumber parser by migrating from mmda.

~There are a bunch of tests that were commented out in tests/test_parsers/test_pdf_plumber_parser.py that I still have to go through, but I wanted to start the PR with what I had so far.~