OCR0044 Glyph Baseline Marking AI Model

Description: This Card explains the unet model we have made to mark glyph baseline in detail.

Background: Currently our OCR team is experimenting with scripture font creation. For this we are running google OCR (OCR0021) on tibetan publication images like derge,pecing,etc. We get an image of the target character along with extra letters on the side. We give annotators these images to:

crop out the target glyph draw the baseline.

Since this Job follows a specific pattern of drawing a box on the baseline, ai models can perform well for such problems even on very less data.

Diagram: Below is the diagram explaining the process in the simplest form. It shows us all the processes involved- dataset creation, training, and inference.

Model: We use a simple conditional Unet model for this task. Model has 31 million parameters.

Case Study: For demonstration we trained this model on Derge dataset. We have a total of 23,000 Data of derge glyphs that have their baseline marked by annotators. Training on: 3k image dataset only 50 epochs (10 minutes)

we achieve almost 100% accuracy on visual inspection of models predictions on testing data.

example outputs:

Potential: From now on the work of glyph baseline marking can be easily automated to save company's resources by using this highly accurate and fast ai model.

New pipeline will look like:

Collect 1-2k data of annotators work
train the model on this data
Use this model to mark glyphs baseline for the rest of the dataset

OpenPecha / Glyph-Baseline-Marking-AI-Model

OCR0044 Glyph Baseline Marking AI Model #1