OpenPecha / prodigy-tools

Tools for OpenPecha's use of Prodigy
MIT License
0 stars 1 forks source link

interface for input and review of OCR for the PaganTibet project #87

Open eroux opened 1 year ago

eroux commented 1 year ago

The PaganTibet project needs a Prodigy interface to do two separate things:

The workflow is organized in batches, it will be performed by monks in Nepal. The file structure is:

batch_name/
      images/
           image1.jpg (here the filename can vary)
           ...
      page/
           image1.xml (the PAGE file corresponding to images/image1.jpg)
           ...

The files are in s3://image-processing.bdrc.io/PT/. The deadline for a first draft of the interface is Monday, then the project will train the monks in Nepal to do input and reviewing