minhhoai1001 / ppocr-vietnamese

Apache License 2.0
0 stars 0 forks source link

Optical Character Recognition (OCR) - Overview

OCR is a key field in computer vision. One typical scenario is the recognition of structured texts of particular areas, which is widely used in daily life, such as license plate recognition, bank card information recognition, ID card information recognition, train ticket information recognition, and so on.

Challenges in OCR

The two main method in OCR problems:

Text detection

Some popular text detection algorithms can be roughly divided into two categories:

Further

As OCR technology continues to prosper and the demand for structured information extraction is growing, various technologies concerning intelligent document analysis, like layout analysis, table recognition, and key information extraction, have gained increasing attention.

PaddleOCR - Vietnamese

Introduction

PaddleOCR aims to create multilingual, awesome, leading, and practical OCR tools that help users train better models and apply them into practice.

Install Paddlepaddle-gpu

Install PaddleOCR-Vietnamese

git clone https://github.com/minhhoai1001/ppocr-vietnamese.git
pip install -r requirements.txt

Prepare dataset

Training detection text

Training Recognition text

Inference two model together

    python tools/infer/predict_system.py \
        --image_dir="./train_data/vietnamese/test_image/im1500.jpg" \
        --det_model_dir="./inference/det_r50_sast_vn/" \
        --det_algorithm="SAST" \
        --rec_image_shape="3, 48, 320" \
        --rec_model_dir="./inference/rec_ppocr_v3_vn/" \
        --rec_char_dict_path="./ppocr/utils/dict/vi_vietnam.txt" \
        --vis_font_path="doc/fonts/vietnam-light.ttf"