attardi / wikiextractor

A tool for extracting plain text from Wikipedia dumps
GNU Affero General Public License v3.0
3.74k stars 965 forks source link

Updating clean_markup function to be compatible with Extractor.__init… #318

Open miromannino opened 1 year ago

miromannino commented 1 year ago

clean_markup function in clean.py seems outdated.

It's trying to call Extractor constructor and the Extractor's method clean_text with a wrong number of parameters.