cat-lemonade / PDFDataExtractor

A toolkit for automatically extracting semantic information from PDF files of scientific articles
https://pdfdataextractor.readthedocs.io/en/latest/?
MIT License
64 stars 11 forks source link

running pyton code from demo notebook errors #7

Open tgiannulli opened 1 year ago

tgiannulli commented 1 year ago

ModuleNotFoundError Traceback (most recent call last) /workspaces/PDFDataExtractor2/demo/PDE Demo.ipynb Cell 10 line 1 ----> 1 from pdfdataextractor import Reader

File ~/.local/lib/python3.10/site-packages/pdfdataextractor-1.0-py3.10.egg/pdfdataextractor/init.py:2 1 # # -- coding: utf-8 -- ----> 2 from .extraction import *

File ~/.local/lib/python3.10/site-packages/pdfdataextractor-1.0-py3.10.egg/pdfdataextractor/extraction.py:13 11 from pdfminer.pdfpage import PDFPage 12 from collections import Counter, OrderedDict ---> 13 from templates import * 16 class Reader: 17 """Reader that reads in PDF files"""

File ~/.local/lib/python3.10/site-packages/pdfdataextractor-1.0-py3.10.egg/templates/init.py:1 ----> 1 from .elsevier import ElsevierTemplate 2 from .royal_society_of_chemistry import RoyalSocietyChemistryTemplate 3 from .american_chemical_society import AmericanChemicalSocietyTemplate

File ~/.local/lib/python3.10/site-packages/pdfdataextractor-1.0-py3.10.egg/templates/elsevier.py:5 3 from pdfminer.pdfparser import PDFParser 4 from pdfminer.pdfdocument import PDFDocument ----> 5 from chemdataextractor.doc import Paragraph 6 import re 9 class ElsevierTemplate(Methods):

Installing chemdataextractor or chemdataextractor2 fails to install using codespaces virtual environment python 3.10 Building wheels for collected packages: DAWG Building wheel for DAWG (setup.py) ... error error: subprocess-exited-with-error

Ong-Yi-Kai commented 9 months ago

I faced the same issue. Changing the python version to 3.8 worked for me