CederGroupHub / LimeSoup

LimeSoup is a package to parse HTML or XML papers from different publishers.
MIT License
19 stars 7 forks source link

[ECS] Need to remove references from text #39

Closed OlgaGKononova closed 5 years ago

OlgaGKononova commented 5 years ago

I think, I have already opened similar issue before: in ECS papers very often references numbers are left in front of the sentence. Example DOIs: 10.1149/1.1420706, 10.1149/1.1565141, 10.1149/1.3606475, 10.1149/2.003203jes Please, remove ALL references numbers from text.

I believe it should be solved by removing statements similar ti this: <sup><a class=\"xref-bibr\" href=\"#ref-25\" id=\"xref-ref-25-1\">25</a></sup>

hhaoyan commented 5 years ago

This is solved in the revised version of ECS parser. But I have not run it yet. I made several fixes to other parsers including elsevier nature etc. The new paragraphs will likely come during next week.