Open dnk8n opened 3 years ago
I have come across the same issue. Seems any words that are tagged as page does not exists will be skipped automatically. Is there any way to solve it and keep the words?
I'm having the same problem. I lost most of the page's context; only the first or second words appear, and the rest is gone. Is there a way to fix it?
Note in the below example, in paragraph 2, sentence 1, how "its founding president was Luigi Vittorio Bertarelli." was not correctly captured. Instead It was truncated to "its founding president was ."
Original article: https://en.wikipedia.org/wiki?curid=3917542&oldid=1034257382
Wikiextract text output: