openzim / mwoffliner

Mediawiki scraper: all your wiki articles in one highly compressed ZIM file
https://www.npmjs.com/package/mwoffliner
GNU General Public License v3.0
288 stars 73 forks source link

Error downloading article * #1970

Open Dimas-sc opened 10 months ago

Dimas-sc commented 10 months ago

I'm using mwoffliner with the docker image ghcr.io/openzim/mwoffliner:latest. I want to make a Zim file for a local mediawiki installation, version 1.35.0. I run it with this command:

docker run --volume=//$(pwd)/out:/out -ti --rm ghcr.io/openzim/mwoffliner mwoffliner --mwUrl http://localserver/wikiutic --adminEmail mymail@domain.cat --mwWikiPath /wikiutic --mwApiPath /wikiutic/api.php --mwModulePath /wikiutic/load.php --verbose warn

But it fails every time, with errors like this (each time, different article name):

starting redis-server in the background…
(node:13) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023.

Please migrate your code to use AWS SDK for JavaScript (v3).
For more information, check the migration guide at https://a.co/7PzMCcy
(Use `node --trace-warnings ...` to show where the warning was created)
[warn] [2024-01-08T15:20:54.919Z] Got warning from MW Query {
        "query": {
                "*": "Unrecognized value for parameter \"prop\": pageimages"
        }
}
[warn] [2024-01-08T15:20:55.183Z] Article Id is invalid - expected a string but got [], converting to string and continuing
[warn] [2024-01-08T15:20:55.402Z] Couldn't find strings file for [ca], falling back to [en]
[warn] [2024-01-08T15:20:55.788Z] Failed to get CSS from [http://localserver/wikiutic/load.php?lang=ca&modules=ext.visualEditor.desktopArticleTarget.noscript%7Cmediawiki.toc.styles%7Cskins.vector.styles.legacy&only=styles&skin=vector]
[error] [2024-01-08T15:20:55.954Z] Error downloading article Manometries
[error] [2024-01-08T15:20:55.955Z] Failed to run mwoffliner after [3s]: {}
Dimas-sc commented 10 months ago

I tried disabling VisualEditor, and now I'm getting this error:

Unable to find appropriate API end-point to retrieve article HTML

Dimas-sc commented 10 months ago

I tried width :dev image and I got similar errors:

docker run --volume=//$(pwd)/out:/out -ti --rm ghcr.io/openzim/mwoffliner:dev mwoffliner --mwUrl http://localServer --adminEmail mail@domain.cat --mwWikiPath /wikiutic --mwActionApiPath /wikiutic/api.php --mwModulePath /wikiutic/load.php --verbose warn
starting redis-server in the background…
[warn] [2024-01-08T15:58:38.357Z] Got warning from MW Query {
        "main": {
                "warnings": "Unrecognized parameter: colimit."
        },
        "query": {
                "warnings": "Unrecognized values for parameter \"prop\": pageimages, coordinates"
        }
}
[warn] [2024-01-08T15:58:38.608Z] Got warning from MW Query {
        "main": {
                "warnings": "Unrecognized parameter: colimit."
        },
        "query": {
                "warnings": "Unrecognized value for parameter \"prop\": coordinates"
        }
}
[warn] [2024-01-08T15:58:38.609Z] Article Id is invalid - expected a string but got [], converting to string and continuing
[warn] [2024-01-08T15:58:38.909Z] Got warning from MW Query {
        "main": {
                "warnings": "Unrecognized parameter: colimit."
        },
        "query": {
                "warnings": "Unrecognized value for parameter \"prop\": coordinates"
        }
}
[warn] [2024-01-08T15:58:38.920Z] Couldn't find strings file for [ca], falling back to [en]
[warn] [2024-01-08T15:58:39.401Z] Failed to get CSS from [http://localServer/wikiutic/load.php?lang=ca&modules=ext.visualEditor.desktopArticleTarget.noscript%7Cmediawiki.toc.styles%7Cskins.vector.styles.legacy&only=styles&skin=vector]
[error] [2024-01-08T15:58:41.159Z] Unable to retrieve js/css dependencies for article '[object Object]': missingtitle
[error] [2024-01-08T15:58:41.573Z] Article has been deleted.
[error] [2024-01-08T15:58:41.574Z] Article has been deleted.
[error] [2024-01-08T15:58:41.574Z] Error downloading article index.php?title=Pàgina_principal
[error] [2024-01-08T15:58:45.853Z] Error downloading article Generació/Regeneració_d'agendes
[error] [2024-01-08T15:58:45.854Z] Failed to run mwoffliner after [10s]: {}