Closed Zachu closed 2 years ago
I don't know if someone would like to verify the data additions somehow before merging
Just an fyi, but because this repository is relied on by multiple sources, I am going to have to verify everything before I initiate a merge, as I don’t want to break any pre-existing tools, apps, etc.
Sent from my iPhone
On Apr 5, 2022, at 7:14 AM, Jani Korhonen @.***> wrote:
I don't know if someone would like to verify the data additions somehow before merging
— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you are subscribed to this thread.
Just an fyi, but because this repository is relied on by multiple sources, I am going to have to verify everything before I initiate a merge, as I don’t want to break any pre-existing tools, apps, etc.
Would not expect anything less =) Rather what I meant was the new additions if someone else wants to do some verification whether the OCR has been correct and that sort of things.
There was some problems in the not-so-common entries like the card levels A/B (which I now actually read from the filenames since they happen to be there) and the level P whatever that means :sweat_smile:
Also I decided to take just the number part from card names, some having a prefix of "AA-" for example. This limits the character space the tesseract model needed to learn. If they need to be added I would probably script it rather than try to read it with OCR.
There is no need to process this pull request, as the requested data was added via another process.
Discussion in https://github.com/any2cards/worldhaven/issues/3
Since the values are read in OCR there might be some mistakes. These should be fixed by adding them to
scripts/tesseract/manual-fixes-character-ability-cards.jsonl
and merging these too in thedata/character-ability-cards.js
by usingscripts/merge_jsonl_to_json.sh