githubharald / SimpleHTR

Handwritten Text Recognition (HTR) system implemented with TensorFlow.
https://towardsdatascience.com/2326a3487cd5
MIT License
1.99k stars 893 forks source link

Should the training text file contain ground truth as well? #16

Closed AdarshMJ closed 6 years ago

AdarshMJ commented 6 years ago

The words.txt file which is used for training, should it also incorporate a ground truth? Also while training does the main.py make use of the images that were generated as a result of that script used for generating dataset similar to IAM?

I am getting this error

Ground truth -> Recognized [ERR:8] "Electric" -> "" [ERR:8] "Electric" -> "" [ERR:3] "Any" -> "" [ERR:9] "Electrons" -> "" [ERR:9] "Cyclotron" -> "" [ERR:7] "Induced" -> "" [ERR:4] "Mass" -> "" [ERR:4] "Flux" -> "" [ERR:6] "Region" -> "" [ERR:1] "p" -> "" Character error rate: 100.000000%. Word accuracy: 0.000000%.

githubharald commented 6 years ago

If you are unsure what happens with the content of words txt: please refer to this method . The code is commented, and you can also put some print() functions there to see what is happening.

Regarding the output: there is nothing wrong with this output. Therefore closing this issue.