However, this has the following issues that needs to be fixed:
If the source language is not japanese, then the pytesseract_config is not valid.
If the source language is not part of vertical language, we aren't even extracting text from it
Furthermore, the imagetranslation.py file is too big since it contains too much unnecessary logics that can be subdivided into other modules. Therefore, the pytesseract handling should be exported into a seperate file called textextraction.py inside ./src/modules. The file should implement the main method
Currently we process the image using pytesseract right inside the
imagetranslation.py
file.However, this has the following issues that needs to be fixed:
japanese
, then thepytesseract_config
is not valid.Furthermore, the
imagetranslation.py
file is too big since it contains too much unnecessary logics that can be subdivided into other modules. Therefore, the pytesseract handling should be exported into a seperate file calledtextextraction.py
inside./src/modules
. The file should implement the main methodsuch that the
imagetranslation.py
file now will be call it as such