Closed nidame closed 1 year ago
Hello! Thank you very much!
It looks pretty good to me :)
@ponteineptique will we be able to use HUMG on this dataset?
If it's PageXML, yes absolutely :)
It's alto xml but I can also export page xml from the Transkribus website, if necessary
ALTO XML is even better for HUMG :)
(Next time I'll read the proposed record before commenting)
:-))
@PonteIneptique @alix-tz Hi, I just wanted to ask if you could include the metadata of the Devanagari GT in the HTR-United catalogue. Couldn't find it when searching. And I've got new data - GT for the South Indian script Malayalam provided by Tuebingen University Library. Would you be interested in that as well? If yes, I'll write a new issue. Best wishes Nicole
Hello, I just checked the content of the dataset in Mayalam script and it looks good so yes, it would be really interesting to add it. Can you make another issue for it?
Just a note: importing the Page is eScriptorium works, but not the ALTO (because of 1 missing information in the file exported by Transkribus), so can you make sure to keep the Page version in the dataset ?
Before I start a new issue, could you please kindly give me any information on the Devanagari dataset I submitted in November?
I think this issue can be closed, the remaining discussion about the second dataset will be in #104
Hello ! I'd like to include the metadata for my GT dataset on HTR United. The Alto XML files and the images are archived FID4SA@heiDATA, the research data repository of Heidelberg University. DOI to the dataset is included in the metadata. Hope it works! Please get in touch in case there are any questions. Best wishes, Nicole
Here is our dataset YAML file: