openzim / sotoki

StackExchange websites to ZIM scraper
https://library.kiwix.org/?category=stack_exchange
GNU General Public License v3.0
216 stars 25 forks source link

Do not create ZIM with `eng,eng` language metadata #321

Open benoit74 opened 1 week ago

benoit74 commented 1 week ago

See https://library.kiwix.org/raw/english.stackexchange.com_mul_all_2024-05/meta/Language

Analysis: https://github.com/openzim/sotoki/issues/259 requested to enhance the metadata. Implementation done in https://github.com/openzim/sotoki/commit/26716426acb9107be017b306e354188241c11584 is however a bit too naive, it create silly eng,eng metadata when we are dealing with this english stackexchange.

We must probably:

Nota: beware that second point means a renaming of a book, which has to be handled with the book renaming procedure to avoid issues in the library and imager service (double books)