OpenPecha / SynthImage

MIT License
0 stars 0 forks source link

OCR0048: Enhance Synthetic Page Images with Augmentation Libraries #16

Open jim-gyas opened 1 month ago

jim-gyas commented 1 month ago

Description:

Develop a Python script to apply augmentations to synthetic page images using various libraries. Users can select their preferred augmentations from a predefined list. The script will enhance the existing synthetic page images by applying the selected augmentations. The main goal is to create diverse and realistic training images to improve the OCR dataset.

Resources:

1)Synthetic Page Image: https://github.com/OpenPecha/SynthImage 2)Augmentation Libraries:

List of Augmentation using Augraphy:

Implementation:

Screenshot 2024-08-09 at 4 50 15 AM

Sub Task

jim-gyas commented 1 month ago

Sythetic Page Image

Image

Albumentation Augmentation:

Perspective Augmentation

Image

Median Blur Augmentation

Image

Shadow Augmentation

Image

Sun Flare

Image

jim-gyas commented 1 month ago

Original Image:

Image

Augraphy Albumentation Example:

Image Image Image Image

kaldan007 commented 1 month ago

@jim-gyas I would rather make each augmentation a separate module like you have done with distrotion and deformation. selection of with augmentation is common to all augmentation. Then why do u have that selection feature only for those two libraries augmentation. I remember u have a randomization feature implemented in a module.

jim-gyas commented 1 month ago

Transpose Augmentation:

Image

Problem in Line Extraction for Transpose Augmentation:

Image Image Image

After Solving the problem:

Image Image Image Image
Image

jim-gyas commented 1 month ago

Input Synthetic Page Image:

Screenshot 2024-08-09 at 11 01 41 AM

Augraphy Augmentation:

Bad Photo Copy Augmentation:

Screenshot 2024-08-09 at 10 53 12 AM

Dirty Rollers Augmentation:

Screenshot 2024-08-09 at 10 53 24 AM

Ink Bleed Augmentation:

Screenshot 2024-08-09 at 10 53 35 AM

Scribble Augmentation:

Screenshot 2024-08-09 at 10 54 06 AM

Page Texture Augmentation:

Screenshot 2024-08-09 at 10 53 51 AM