This PR improves the quality of the training yoruba data by adding some properly marked text and also removing some of the badly marked text which led to misrecognized characters.
One question I have is how frequently we plan to update the trainingdata with new data?
This PR improves the quality of the training yoruba data by adding some properly marked text and also removing some of the badly marked text which led to misrecognized characters.
One question I have is how frequently we plan to update the trainingdata with new data?
@Shreeshrii @zdenop