Closed cmheidt closed 6 years ago
@VanChriz Will look into this issue. cc/ @reckart
@VanChriz Good news: I've reproduced the reported stacktrace in a minimal test setup with a reduced markup text version of https://de.wikipedia.org/wiki/Humanbiologie which contains a table structure in it.
Next, I will now work on a bugfix for parsing such structures correctly.
@mawiesne Thanks! I'm sure rz will be appreciative too.
Hi,
I am trying to add the plain texts to Wikipedia Page objects using the
Page.getPlainText
method. However fetching these plain texts results in aVisitNotFoundException
. The Exception occurs, for example, with the German Wikipedia Articles "Reaktive Sauerstoffspezies" or "Humanbiologie" but does not occur, for example, with "Insulin im Gehirn". Fetching the markup text instead works just fine. I included the stacktrace below, could you look into this bug?