junxnone / aiwiki

AI Wiki
https://junxnone.github.io/aiwiki
18 stars 2 forks source link

ML Tasks Image OCR TextScanner #192

Open junxnone opened 4 years ago

junxnone commented 4 years ago

Reference

Brief

Model 缺点 优点
RNN 注意力漂移
Segmentation 字符分割为不同部分 & 多字符黏连 不规则字体识别
Branch Output Description
Class Branch
类别分支
Character Segmentation Maps 字符分割 mask
Geometry Branch
几何分支
Localization Maps 字符位置顺序

Attention Decoder vs Segmentation-based vs TextScanner

image

Arch

image

Class Branch

Geometry Branch

image

Mutual-Supervision

image

Word Formation

classification scores image

Training

image

Loss

image

Mutual supervision Loss image
image

λ = 0.2 - γ = 2

Test

English Dataset Test

image

Chinese Dataset Test

image

junxnone commented 4 years ago

junxnone/tech-io#749