idio / json-wikipedia

Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby
17 stars 2 forks source link

Parsing Inline References #51

Closed stathischaritos closed 6 years ago

stathischaritos commented 6 years ago

Connects to #44 We parse the inline references and add them in a list inside ParagraphsWithLinks. We keep the inner text ,start and end position, to be used by wikistats for ignoring them.

tgalery commented 6 years ago

I think it would be good to squash all the commits you have into one after you address comments and maybe we can run this on the "simple" wikipedia

hmcc commented 6 years ago

Minor nitpicks, but looking good!