Hi, I'm new to this field, and I'm learning a lot of new stuff. But I'm quite stuck at the training data part. I've seen how the ocrd test samples with ground truth. I've extracted that data, followed all the steps, and created the trained data. But when I'm trying to create my own data set to train the tesseract with English as my preferred language, I get a lot of errors when trying to train it. I'm not quite sure where I've gone wrong. Can you please tell me how the foo data set was made? It will be really helpful.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Hi, I'm new to this field, and I'm learning a lot of new stuff. But I'm quite stuck at the training data part. I've seen how the ocrd test samples with ground truth. I've extracted that data, followed all the steps, and created the trained data. But when I'm trying to create my own data set to train the tesseract with English as my preferred language, I get a lot of errors when trying to train it. I'm not quite sure where I've gone wrong. Can you please tell me how the foo data set was made? It will be really helpful.