The task is to export a MediaWiki to HTML.
There is the extension DumpHTML, but it is unmaintained: https://www.mediawiki.org/wiki/Extension%3aDumpHTML
This Python script supports the following features:
You need to use a bot password, to make the script work, see [[Special:BotPasswords]] / https://www.mediawiki.org/wiki/Manual:Bot_passwords
git clone https://github.com/SolidCharity/exportMediaWiki2HTML.git
cd exportMediaWiki2HTML
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
For all commands, you need to activate the virtual environment first:
cd exportMediaWiki2HTML
source .venv/bin/activate
Please pass the url of the wiki
python3 exportMediaWiki2Html.py --url https://mywiki.example.org
Optionally pass the page id of the page you want to download, eg. for debugging:
python3 exportMediaWiki2Html.py --url https://mywiki.example.org --page 180
Optionally pass the name of a Bot and the Bot password (create a Bot at https://wiki.example.org/index.php?title=Spezial:BotPasswords):
python3 exportMediaWiki2Html.py --url https://mywiki.example.org --user "myuser@botname" --password "botpwd" [pageid]
You can use --help
to see all options.
Feel free to file any issues, and Pull Requests are welcome as well!