lizeyan / Zotero-meta-update

Update existing items' metadata by searching in DBLP
MIT License
40 stars 2 forks source link

Weird HTML tags <scp> in some titles #14

Open albert-ying opened 1 year ago

albert-ying commented 1 year ago

Hi Zeyan,

Thank you for this amazing tool! I just updated my whole library today and I find out that some of the paper titles contains HTML tags, which is unexpected.

CleanShot 2023-11-13 at 02 05 42@2x Link

CleanShot 2023-11-13 at 02 10 53@2x link

And this is what it looks like in word CleanShot 2023-11-13 at 02 11 52@2x

lizeyan commented 1 year ago

I think it is from the database (crossref or dblp). I haven't add any tags

albert-ying commented 1 year ago

Yea, I agree. It may come from web scraping. Maybe it makes sense to filter out this specific tag since it may cause problem?