petermr / pyami

Semantic Reader of the Scientific Literature
Apache License 2.0
12 stars 9 forks source link

isolated bold full-stops in leading paragraphs (IPCC executive summary) lose first sentence #7

Open petermr opened 1 year ago

petermr commented 1 year ago

test_html.py def test_make_ipcc_obsidian_md(self):

first bold sentence is extracted on basis of continuous bold text. Some paras are followed by confidence limits and then isolated bold full stops. These are erroneously treated as the first sentence.