Open jsta opened 7 years ago
IF ONLY we knew of someone who was gonna be at the unconference and came from a wiki background and thought this was hella cool! :D
(Seriously, this seems like a fun project. +1 from me. I kinda want to work on a markup parser at some point too)
Wikimarkup parsing in R mwparser
repo: https://github.com/Ironholds/mwparser
I have had some interest from rOpenSci'ers in a package I wrote to access lake metadata from Wikipedia (https://github.com/jsta/wikilake). Unfortunately, it is a bit messy because I don't think I'm parsing the xml very efficiently (see https://stackoverflow.com/questions/33446888/r-convert-xml-data-to-data-frame). I could see getting a group together at the unconf to work on this and possibly extend it to other categories of Wikipedia "metadata" boxes.