riceissa / project-ideas

Project ideas for Issa Rice (see repo issues)
The Unlicense
6 stars 0 forks source link

Better conversion from MediaWiki to Markdown #26

Open riceissa opened 6 years ago

riceissa commented 6 years ago

I think Pandoc is the best converter at the moment, but it doesn't handle templates, which are used a lot in MediaWiki documents. Maybe writing a Pandoc filter to handle templates can get decent output. Or maybe some method involving going from MediaWiki to some intermediate format, then to Markdown would work.

I made some progress on this at https://github.com/riceissa/pandocfilters-examples/blob/master/mediawiki_refs.lua but Pandoc does not parse duplicate ref tags (the name attribute for the tag) so you get a lot of blank footnotes.