CederGroupHub / LimeSoup

LimeSoup is a package to parse HTML or XML papers from different publishers.
MIT License
19 stars 7 forks source link

Feedback on Wiley parser #22

Closed hhaoyan closed 5 years ago

hhaoyan commented 5 years ago

Here is a list of issues found for the Wiley parser:

Issue 2 is the most urgent one.

eddotman commented 5 years ago

We found that some Wiley papers don't have any section names at all. Is there a preferred behavior / output for when no section name exists?

zjensen262 commented 5 years ago

What is the desired output when a paper does not have a section structure. For example, this paper here https://doi.org/10.1002/anie.201508702. These papers also don't use the standard tags since there are no headers.

zjensen262 commented 5 years ago

@hhaoyan could you address the above comments. Sorry forgot to link anyone in on this.

hhaoyan commented 5 years ago

Sorry for the late reply... For sections without a title, I think we can leave it as empty?