idio / json-wikipedia

Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby
17 stars 2 forks source link

Investigate moving to scala json parser of wiki dump #48

Closed stathischaritos closed 5 years ago

stathischaritos commented 7 years ago

We can use this as a starting point. https://github.com/mindfulmachines/wiki-parser

tgalery commented 7 years ago

Working with @stathischaritos at removing some deps from idio/json-wikipedia made me realise that that repo and java are bullshit. It might be nice to start a new project that parses wikimedia markup in Scala from scratch. To do the heavy lifting, we could use libs like