Open chullhwan-song opened 5 years ago
데이터 이름 | training set | test set | val set | 언어 | 형태 |
---|---|---|---|---|---|
IC13 | 229 | 233 | En | horizontal | |
IC15 - Incidental Scene Text | 1000 | 500 | En | Google Glass, quadrilaterals | |
IC17 | 7,200 | 9,000 | 1,800 | multi-lingual | |
MSRA-TD500 | 300 | 200 | EN, CH | line-level | |
TotalText | 1255 | 300 | curved texts | ||
CTW-1500 | 1000 | 500 | |||
COCO-Text | 43,686 | 20,000 |
research | Pretrain | Training Data | augmentation |
---|---|---|---|
PixelLink | No | IC15-train | |
SegLink | SynthText | ,IC15-train | |
EAST | ImageNet | IC15-train ,IC13-train(229개) | |
Text-Block FCN | ImageNet | IC15-train | Y |
FOTS | ImageNet, SynthText | MLT 학습/val set, IC15-train+IC13-train | Y, i) longer sides of images are resized from 640 pixels to 2560 pixels, ii) rotated in range [−10, 10] ] randomly, iii) rescaled with ratio from 0.8 to 1.2 iv) 640×640 random samples are cropped from the transformed images. |
research | Pretrained | Training Data | augmentation |
---|---|---|---|
PixelLink | IC15-train | ITD500-train + HUST-TR400 | |
EAST | ImageNet | TD500-train, HUSTTR400 | |
Text-Block FCN | ImageNet | TD500-train | Y |
[1] | ImageNet | TD500-train + HUST-TR400 | Y |
research | Pretrain | Training Data | augmentation |
---|---|---|---|
PixelLink | IC15-train | IC13-train,TD500-train and HUST-TR400 | Y |
FOTS | SynthText, ImageNet | MLT 학습셋+val set, IC15-train+IC13-train | Y IC15와 동일 |
[1] | ImageNet | IC15-train+IC13-train | Y |
research | Pretrain | Training Data | augmentation |
---|---|---|---|
FOTS | SynthText, ImageNet | MLT 학습셋+val set | Y IC15와 동일 |
[1] | ImageNet | MLT 학습셋+val set | Y |
research | Pretrain | Training Data | augmentation |
---|---|---|---|
[1] | ImageNet | RCTW | Y |
학습셋
Paper
[1] Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping # Multi-scale evaluation