OpenPecha / find-complex-ligature

MIT License
0 stars 0 forks source link

OCR0036: Find remaining glyphs of complex ligatures #1

Open 10kalden opened 1 month ago

10kalden commented 1 month ago

Description: Around 300 glyphs from OPF text and 100 glyphs or complex ligatures are yet to be found. We have methods and different approaches to extract all the glyphs required.

Implementation: There are two approaches we have developed for this,

  1. Use the already obtained individual glyphs, stack them according to the ligature and apply morph and augmentation to obtain the best possible result.
  2. Use annotators to draw the missing glyphs as closely as the glyphs from the publication and scan them.

Subtask:

Completion Criteria: To obtain the remaining missing glyphs

kaldan007 commented 1 month ago

Please reach out to gen Lopa on 100 missing glyph.

10kalden commented 1 month ago

Image

kaldan007 commented 1 month ago

please update the card with new approach.

10kalden commented 1 month ago

ligatures obtained from joining glyphs

Image

Image

Image

kaldan007 commented 1 month ago

Kindly scan https://github.com/OpenPecha-Data/P000002 for the missing glphys. Use all the glyphs u font and generate a font.

10kalden commented 1 month ago

The OPF data for P000002 is incomplete, there are no image references for where the char occurs.

Image

kaldan007 commented 1 month ago

Please check commit history of the opf

10kalden commented 1 month ago

Found 73 glyphs from Derge tengyur

kaldan007 commented 1 month ago

we are left with 23 missing glyphs.