janetzki / fact_extraction

Fact Extraction from Text
6 stars 0 forks source link

Extracted HTML from Wikipedia dump is not clean #76

Closed janetzki closed 7 years ago

janetzki commented 7 years ago

It still contains parts of Wikimarkup.