FudanVI / FudanOCR

A toolbox of scene text super-resolution and recognition

367 stars 62 forks source link

readme

FudanOCR

This toolbox contains the implementations of the following papers:

EAFormer: Scene Text Segmentation with Edge-Aware Transformers [Yu et al., ECCV-24]
Scene Text Segmentation with Text-Focused Transformers [Yu et al., ACM MM-23]
Weakly-Supervised Text Instance Segmentation [Zu et al., ACM MM-23]
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning [Yu et al., ICCV-23 (Oral)]
Orientation-Independent Chinese Text Recognition in Scene Images [Yu et al., IJCAI-23]
Towards Accurate Video Text Spotting with Text-wise Semantic Reasoning [Zu et al., IJCAI-23]
Chinese Character Recognition with Augmented Character Profile Matching [Zu et al., ACM MM-22]
Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution [Chen et al., AAAI-22]
Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition [Chen et al., IJCAI-21]
Scene Text Telescope: Text-Focused Scene Image Super-Resolution [Chen et al., CVPR-21]

The README.md file in each folder contains the instruction about how to run the code