the-fab-cube / flesh-and-blood-cards

Open source JSON/CSV representations of the cards for the Flesh and Blood TCG
95 stars 35 forks source link

History Pack 1 Black Label Data Entry #118

Open luceleaftea opened 2 years ago

luceleaftea commented 2 years ago

This issue will serve as the staging ground for the History Pack 1 Black Label data entry effort.

Please leave a comment here if you have interest in assisting with transcribing data, and which languages you are interested in transcribing for. When I have the repo ready for this effort, I will update this comment with more instructions and contact those who are interested!

manwaring commented 2 years ago

Here's the breakdown of work I've been thinking of - is this how you're thinking about it, too?

luceleaftea commented 2 years ago

I think you covered it all, yup!

manwaring commented 2 years ago

It's a lot! 😅

luceleaftea commented 2 years ago

It is 😅 I also will need to update the SQL server script too, as I think about it. So, will take a few days, but should hopefully have the repo ready soon! Will give us time to recruit volunteers.

luceleaftea commented 1 year ago

Started setting stuff up over on this branch.

I think the remaining things needing done are adding .ods and .csv files for the files you have listed, and finishing updating the scripts. I'll work more on the scripts either tonight or tomorrow, but if you have time I would appreciate help setting up the rest of the files!

kirkbushell commented 1 year ago

I'm curious if this is something you can partly automate using OCR and telling it the language? All the text sits within certain coordinates, and if the image sizes are identical to the rest (as most are), this should be -relatively- straight forward to transcribe automatically, with some gaps?

I mention this because before I came across this repo, I was working on exactly that as the LSS data is just too erroneous (as we all know - lol)

luceleaftea commented 1 year ago

I have not messed around with OCR enough to want to dedicate time to making an OCR script for the repo right now - for now I will leave the data to be hand entered and double checked, but if you have an OCR script you'd like to use or add to the repo to input the data, I have no issues with that being an available resource for people!

kirkbushell commented 1 year ago

Yeah that's fair. It's pretty easy to scan images using something like tesseract, but I hear you.

The best it could do would be to do the initial population of data, and based on text coordinates it would know whether it's a title, copyright info, card text.etc.

But would still need input for things like card stats, such as pitch/attack.etc (although this can certainly be done using machine learning tools, but then that would start to cost money, so...).

luceleaftea commented 1 year ago

Honestly the text is the hardest part for me personally (at least if you're counting the amount of text bugfixes in the repo history....), so something to get the initial text in would be pretty rad in the long term.

kirkbushell commented 1 year ago

It's something I began working on for FaB DB. I'll see if it'll work if we set the language and share :)

mstraa commented 1 year ago

Hi ! I can look at the OCR script, but for me, the only things needed are Name and "Inner text". Every stats can be found with initial ref of the card ( EN : 1HP204 : https://storage.googleapis.com/fabmaster/media/images/1HP204.width-450.png - FR : https://storage.googleapis.com/fabmaster/cardfaces/2022-1HP/FR/FR_1HP204.png).

I'll try to do something when I'll have an evening to spare ! :D

kirkbushell commented 1 year ago

Hi ! I can look at the OCR script, but for me, the only things needed are Name and "Inner text". Every stats can be found with initial ref of the card ( EN : 1HP204 : https://storage.googleapis.com/fabmaster/media/images/1HP204.width-450.png - FR : https://storage.googleapis.com/fabmaster/cardfaces/2022-1HP/FR/FR_1HP204.png).

I'll try to do something when I'll have an evening to spare ! :D

that's a really good point about the card stats already being in place! Ez mode then! haha

The biggest challenge will be the icons in the card text.

CarlosGGFAB commented 1 year ago

Hello, my name is Carlos Gutiérrez and I would love to help with the Spanish translations.

Just-a-Human96 commented 1 year ago

Hi, my name is Tim, if i can help with the translations in any way let me know. Since i am german that's probably where i could help the most, but if i can help any other way i would be happy to do so.

Mofte commented 1 year ago

Hey there! Not sure if needed, but I could occasionally help to input some German cards, especially with the upcoming HP2 and Outsiders cards there should be plenty of work. ^^ Timo

gre99ory commented 1 year ago

Hello everyone, My name is Gregory, I'm French and willing to help for French if needed. Just let me know !

luceleaftea commented 1 year ago

Hey. there everyone, thank you so much for all of the offers! I'm still recovering from Outsiders spoiler season, but after I get some rest I'll finish up getting the repo ready for you all to help out 😄