open-mmlab / mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox
https://mmocr.readthedocs.io/en/dev-1.x/
Apache License 2.0
4.3k stars 744 forks source link

[Attention] Be an early contributor to MMOCR 1.0! #1493

Open gaotongxiao opened 1 year ago

gaotongxiao commented 1 year ago

In Sep 2022, we released the very first release candidate of MMOCR 1.0, bringing tons of new features to the community. MMOCR 1.0 has made many significant changes hoping to address critical complaints and requirements from thousands of users, the actual implementation is never sufficiently supervised by our community. Some changes might be progressive, and some might be bad.

So that's why we have an RC preiod - we hope to broadly collect advice and suggestions from our community and continuously adopt ideas/implementations to improve this beta version before we finalize it into a robust one. We call out to everyone to be early users and even developers of our latest version. Here are some aspects for everyone to get involved in and improve together, you can claim the task by replying to this issue. Your feedback is greatly appreciated :)

Documentation

Features

If you want to be a contributor, here are some simple starting points:

Some amazing features are worth time exploring:

Visualization

Currently, browse_dataset.py is able to visualize the image and annotation processed through data transforms, but sometimes:

these features are missing from the scripts and the implementation of them would be a great plus to research.

Optimization of HmeanIoUMetric (Medium)

As reported in this issue, HmeanIoUMetric is still not efficient enough when the numbers of gt and pred polygons are large. It might be sped up with

There might be some better solutions out there that have not yet been discovered. So it may be left as an open-ended question that will be improved step by step.

xinke-wang commented 1 year ago

I've raised PR #1503 to support the browse_dataset.py with visualizing raw images.