angelosalatino / dblp-parser

A simple python script for parsing DBLP dataset
MIT License
21 stars 4 forks source link

XMLSyntaxError #2

Closed ShunyangLi closed 1 year ago

ShunyangLi commented 1 year ago

I got the following error:

  File "dblp.xml", line 284
    <author>Francesco Trov&ograve;</author>
                                 ^
XMLSyntaxError: Entity 'ograve' not defined, line 284, column 31

It seems to be an error caused by decoding

angelosalatino commented 1 year ago

Hi

this is the first time I am going though this. Have you included the DTD?

I have just run the code on my side, including the dtd and it works fine, as the definiiton of ograve is included in there.

ShunyangLi commented 1 year ago

Hi,

I solved this issue by including DTD. I saw the code had a download function I thought it would automatically download. However, it didn't download automatically, so I downloaded it myself and it worked.

Thanks so much, the code is really helpful.

Best, Shunyang