clovaai / CLEval

CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks
MIT License
185 stars 28 forks source link

Happy to see fair evaluation metric being developed #2

Closed rkcosmos closed 4 years ago

rkcosmos commented 4 years ago

Hello Clova team,

Greeting from EasyOCR, I'm looking forward to have EasyOCR evaluated by fair metric. (I'm a bit scared of course but it's always better to know own's weakness than living in illusion) May I suggest you to add support for datasets in other language? I think UnrealText has the potential. (I have not tested it myself though.)

Thanks, Rakpong

dev-strender commented 4 years ago

Oh, hi @rkcosmos !

First of all, thank you for your interest on our evaluation. And also thanks for your contribution on OCR community with maintaining EasyOCR, an amazing OCR inference open source code utilizing our works.

As we mentioned on README, we did not test on multi-lingual dataset since there exists more things to consider, like

You recommended the UnrealText for test set to examine, and yes, we are planning to target on MLT19(https://rrc.cvc.uab.es/?ch=15), which has the same kinds of languages.

We hope to develop it as soon as possible, so stay tuned! OR, any pull requests are welcome :)

rkcosmos commented 4 years ago

Hi, yes we use a lot of your work. Before launching our project, we looked into quite a number of repositories and found that your team produces easy-to-read + really work as advertised + under free license code. We really appreciate your team's effort in open sourcing their work.

It's good to know you are targeting MLT19. From your TODO on readme, you mention trying to calculate length of text. Let's say in Hindi you can have र+ ्+ क+ ि = र्कि. Do you want length = 4 or length = 1? If you want length = 4, then you can just use len(). If you want length = 1, then you can render them and calculate the width of image. I can even try to create a function for that. (would like to find a way to contribute back to your team :) )