FudanOCR
This toolbox contains the implementations of the following papers:
- EAFormer: Scene Text Segmentation with Edge-Aware Transformers [Yu et al., ECCV-24]
- Scene Text Segmentation with Text-Focused Transformers [Yu et al., ACM MM-23]
- Weakly-Supervised Text Instance Segmentation [Zu et al., ACM MM-23]
- Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning [Yu et al., ICCV-23 (Oral)]
- Orientation-Independent Chinese Text Recognition in Scene Images [Yu et al., IJCAI-23]
- Towards Accurate Video Text Spotting with Text-wise Semantic Reasoning [Zu et al., IJCAI-23]
- Chinese Character Recognition with Augmented Character Profile Matching [Zu et al., ACM MM-22]
- Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution [Chen et al., AAAI-22]
- Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition [Chen et al., IJCAI-21]
- Scene Text Telescope: Text-Focused Scene Image Super-Resolution [Chen et al., CVPR-21]
The README.md
file in each folder contains the instruction about how to run the code