Closed Plagiatus closed 1 year ago
All data in the file is taken directly from the wiki at https://chatwars-wiki.de/ so any errors in the data should be fixed there. I think the proper thing to do would be to remove one of the pages, but you may want to ask in Wiki Telegram group first at https://t.me/joinchat/AaDbq05AZVnNRqU0nojCWg. You can check https://chatwars-wiki.de/index.php?title=Contribute_to_the_CW-Wiki for more details.
You're saying it's taken directly, but as a matter of fact the page that the first entry links to (https://chatwars-wiki.de/index.php?title=Indissoluble_Apron) doesn't exist (anymore).
So I'm guessing there was a page there at some point, then it was removed/moved/etc but remained in the scraped data.
I guess you're right. I haven't touched this code in years now, but it does some stuff caching previous results to avoid hammering the wiki server. I guess that doesn't pick up on deletions.
I have just regenerated the entire file from scratch, and the duplicate seems to be gone now.
This might be two different issues, but here goes.
I'm using your repository for some of my own stuff (generally great work btw), and I've come across an issue:
Data in the
resources_v2.json
file is duplicated and partially out of date (in fact it's an issue in both v1 and v2, but I guess v1 isn't as important anymore).The item I noticed it with is
a118
- it exists twice.First entry
second entry
As you can see, the first entry contains outdated data - calling the item "Indissoluble Apron" in everything but the
"name"
field - including referring to the Apron recipes as ingredients.A local fix for others running into this issue could be to sort the data by
lastModified
orrevision
and only use the newer entry, at least that's what let me work around the issue for now.