Unfortunately, there's no easy answer to your question.
This project is pretty complicated, especially the data preparation and pre-processing part. For data preparation I borrowed most of the code from other popular repos, e.g. Pay20Y. Data prep code is stored in synthtext.py and icdar.py. Unfortunately it is not easily readable and digestible. It took me a couple of full days to read into it and modify it according to my needs.
As of now I'm not aware of any good example of well written and readable code for data prep for FOTS and sadly, this repo is not an exception (data prep code should ideally be totally refactored).
To go back to your question, my advise would be to carefully investigate how does the model training input look like and how is it used. Then move on from there.
Unfortunately, there's no easy answer to your question.
This project is pretty complicated, especially the data preparation and pre-processing part. For data preparation I borrowed most of the code from other popular repos, e.g. Pay20Y. Data prep code is stored in
synthtext.py
andicdar.py
. Unfortunately it is not easily readable and digestible. It took me a couple of full days to read into it and modify it according to my needs.As of now I'm not aware of any good example of well written and readable code for data prep for FOTS and sadly, this repo is not an exception (data prep code should ideally be totally refactored).
To go back to your question, my advise would be to carefully investigate how does the model training input look like and how is it used. Then move on from there.