A lot of logic for cleaning wikipedia markup is already implemented in json-wikipedia and in general it's much easier to work with because annotations are explicitly specified separately from the text of the article.
We should add an option to use jsonpedia directly, without pre-processing the XML dump.
A lot of logic for cleaning wikipedia markup is already implemented in json-wikipedia and in general it's much easier to work with because annotations are explicitly specified separately from the text of the article.
We should add an option to use jsonpedia directly, without pre-processing the XML dump.