openzim / zim-requests

Want a new ZIM file? Propose ZIM content improvements or fixes? Here you are!
https://farm.openzim.org
42 stars 3 forks source link

thalesdoc_en_all is failing #1214

Open kelson42 opened 5 days ago

kelson42 commented 5 days ago

Recipe URL

https://farm.openzim.org/recipes/thalesdoc_en_all/

Last log lines

[zimit::2024-11-22 06:34:07,976] INFO:----------
[zimit::2024-11-22 06:34:07,976] INFO:Processing WARC files in/at /output/.tmpsx6dc3yt/collections/crawl-20241116163210663/archive
[zimit::2024-11-22 06:34:07,976] INFO:Calling warc2zim with these args: ['--name=thalesdoc_en_all', '--tags=HSM CipherTrust', '--favicon=https://drive.farm.openzim.org/thalesdoc_en_all/favicon.png', '--zim-file=thalesdoc_en_all_{period}.zim', '--publisher=openZIM', '--scraper-suffix', 'zimit 2.1.6', '--output', '/output', '--url', 'https://thalesdocs.com/', '--custom-css', 'https://drive.farm.openzim.org/thalesdoc_en_all/custom.css', '--title', 'Thales CPL Documentation Hub', '--description', 'Home to all your Cloud Protection and Licensing product documentation needs', '--lang', 'eng', '-v', '--progress-file', '/output/warc2zim.json', '/output/.tmpsx6dc3yt/collections/crawl-20241116163210663/archive']
[warc2zim::2024-11-22 06:34:07,978] DEBUG:Attempting to confirm output is writable in directory /output
[warc2zim::2024-11-22 06:34:07,978] DEBUG:Output is writable. Temporary file used for test: /output/tmp2k9xk5h_
[warc2zim::2024-11-22 06:34:07,978] DEBUG:Confirming ZIM file can be created using name: thalesdoc_en_all_2024-11.zim
[warc2zim::2024-11-22 06:34:07,979] DEBUG:4 WARC files found
[warc2zim::2024-11-22 06:34:07,999] DEBUG:Title: Thales CPL Documentation Hub
[warc2zim::2024-11-22 06:34:07,999] DEBUG:Language: eng
[warc2zim::2024-11-22 06:34:07,999] DEBUG:Favicons to consider: https://drive.farm.openzim.org/thalesdoc_en_all/favicon.png
[warc2zim::2024-11-22 06:34:08,020] ERROR:Main URL returned an unprocessable HTTP code: 403
[zimit::2024-11-22 06:34:08,021] INFO:
[zimit::2024-11-22 06:34:08,021] INFO:
[zimit::2024-11-22 06:34:08,021] INFO:SIGINT/SIGTERM received, stopping zimit
[zimit::2024-11-22 06:34:08,021] INFO:
[zimit::2024-11-22 06:34:08,021] INFO:

How many times the recipe failed in a row?

Once

How many ZIM have been produced before failure?

Many

Which action did you undertake so far?

None, I have no idea of what to do

What's next?

I don't know

More details

Really late "crash" for a very unclear reason. Maybe a bug, if not the message would benefit to be clearer IMHO.

benoit74 commented 1 day ago

Upstream issue: https://github.com/openzim/warc2zim/issues/424