CederGroupHub / LimeSoup

LimeSoup is a package to parse HTML or XML papers from different publishers.
MIT License
19 stars 7 forks source link

which file format should be given #47

Open ahmedusm opened 3 years ago

ahmedusm commented 3 years ago

I am unable to understand the usage case for LimeSoup. I am not sure which format for the article should be given here: with open(article, 'r', encoding = 'utf-8') as f: html_str = f.read() Moreover, I guess this usage given is for a single article. what if there are thousands of articles to be parsed.