Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby
17
stars
2
forks
source link
List items being picked up as independent paragraphs #41
Open
keynmol opened 7 years ago
Example: https://simple.wikipedia.org/wiki/Human_evolution ("Species list" section)
In XML dump this looks like this:
Jsonpedia contains a very weird split with annotations being jammed together with wrong offsets: