attardi / wikiextractor

A tool for extracting plain text from Wikipedia dumps
GNU Affero General Public License v3.0
3.74k stars 965 forks source link

Template errors in article #314

Open etoilestar opened 1 year ago

etoilestar commented 1 year ago

hello, I also meet this problem when I was running python -m wikiextractor.WikiExtractor enwiki-latest-pages-articles.xml.bz2 --json, such as "WARNING: Template errors in article 'Breakout (video game)' (453246): title(0) recursion(0, 0, 2)", I was running this script on Ubuntu, with python3, and the wikiextractor package is update.

yulingao commented 1 year ago

same question

TommasoPetrolito commented 1 year ago

Someone reported a similar issue... the answer is: "The warnings are normal, due to malformed templates." See more details: https://github.com/attardi/wikiextractor/issues/33