OpenPecha / Toolkit

🛠 Tools to create, edit and export texts and annotations
https://toolkit.openpecha.org
Apache License 2.0
7 stars 4 forks source link

NorbuKetaka batch2 #241

Closed eroux closed 6 months ago

eroux commented 1 year ago

This is a follow-up of

https://github.com/OpenPecha/Toolkit/issues/208

I've uploaded the files of the new (and likely final) batch on

s3://ocr.bdrc.io/NorbuKetaka2/

let's have batch-0001 for the batch id, and the following info.json

{
   "timestamp": "2023-02-01T00:00:00Z"
}
10zintopjor commented 1 year ago

@eroux Do you mean to move those csv files in relevant s3 folder adn create opf?

eroux commented 1 year ago

Yes

10zintopjor commented 1 year ago

Hey can u checkout the sample opf.In the software_id in meta should it be norbuketaka2 or as before?

eroux commented 1 year ago

thanks! software_id should be as before, I'll look at the sample in a moment

eroux commented 1 year ago

oh sorry I realize I forgot to change the batch_id in my initial comment (my bad), it should be batch-0002

eroux commented 1 year ago

let's have last_modified set to 2023-02-01T00:00:00Z, but other than that it looks good, thanks!

10zintopjor commented 1 year ago

ok then i have to reimport the files to s3

eroux commented 1 year ago

oh, never mind then, having batch-0001 is not a big deal, we can live with that

10zintopjor commented 1 year ago

Hey I have updated the catalog over here.But for the file Works/cf/W1PD133164/norbuketaka2/batch-0002/W1PD133164-I4PD2795.csv the work id W1PD133164 does not have image group id I4PD2795.

eroux commented 1 year ago

thanks a lot!

It appears that in that case W1PD133164 should be instead W1PD133161 so the file should be merged with W1PD133161-I4PD2795.csv. Are there other cases like this?

Also, the point of software_id is to indicate the s3 folder, and it should be norbuketaka, not norbuketaka2, so please move the files on s3 (the opf files look good)

I'll go ahead and import the opf files, I'll tell you if I run in any trouble

10zintopjor commented 1 year ago

No that file is the only issue in batch-0002.

eroux commented 1 year ago

great!