Topdu / OpenOCR

Apache License 2.0
181 stars 19 forks source link

OpenOCR

OpenOCR aims to establish a unified training and evaluation benchmark for scene text detection and recognition algorithms, at the same time, serves as the official code repository for the OCR team from the FVL Laboratory, Fudan University.

We are actively developing and refining it and expect to release the first version as soon as possible.

We sincerely welcome the researcher to recommend OCR or relevant algorithms and point out any potential factual errors or bugs. Upon receiving the suggestions, we will promptly evaluate and critically reproduce them. We look forward to collaborating with you to advance the development of OpenOCR and continuously contribute to the OCR community!

Ours STR algorithms

STR

Reproduction schedule:

Method Venue Training Evaluation Contributor
CRNN TPAMI 2016
ASTER TPAMI 2019 pretto0
NRTR ICDAR 2019
SAR AAAI 2019 pretto0
MORAN PR 2019 Debug
DAN AAAI 2020
RobustScanner ECCV 2020 pretto0
AutoSTR ECCV 2020
SRN CVPR 2020 pretto0
SEED CVPR 2020
ABINet CVPR 2021 YesianRohn
VisionLAN ICCV 2021 YesianRohn
SVTR IJCAI 2022
PARSeq ECCV 2022
MATRN ECCV 2022
MGP-STR ECCV 2022
CPPD 2023
LPV IJCAI 2023
MAERec(Union14M) ICCV 2023
LISTER ICCV 2023
CDistNet IJCV 2024 YesianRohn
BUSNet AAAI 2024
DCTC AAAI 2024 TODO
CAM PR 2024
OTE CVPR 2024
CFF IJCAI 2024 TODO
DPTR ACM MM 2024 TODO
VIPTR ACM CIKM 2024 TODO
IGTR 2024
SMTR 2024
FocalSVTR-CTC 2024
SVTRv2 2024
ResNet+Trans-CTC
ViT-CTC

Contributors


Yiming Lei (pretto0) and Xingsong Ye (YesianRohn) from the FVL Laboratory, Fudan University, under the guidance of Professor Zhineng Chen, completed the majority of the algorithm reproduction work. Grateful for their outstanding contributions.


STD

E2E

Acknowledgement

This codebase is built based on the PaddleOCR, PytorchOCR, and MMOCR. Thanks for their awesome work!