Open drdhaval2785 opened 8 years ago
e.g. 04.0001, 04.0002, 04.0004, 04.0005 etc would mean that our data misses out on 04.0003.
Similarly if there are entries like 05.0060, 05.0061, 05.0131 - It would mean that our member 05.0131 is odd at least, as the last entry ended at 61.
Can it be the case? Hardly I believe there are such instances.
While working for verb form generation, I came across some entries where the verb number was a bit wierd e.g. a specific 'gaNa' has < 100 entries and the verb number was > 200. As you are already doing the analysis of verbdata errors, I propose the following modality also.
For every 'gaNa' we analyse for verb numbers and if there are any missing numbers or too high number with no entries in between, it mostly is a missed entry / error respectively.
e.g. 04.0001, 04.0002, 04.0004, 04.0005 etc would mean that our data misses out on 04.0003. Similarly if there are entries like 05.0060, 05.0061, 05.0131 - It would mean that our member 05.0131 is odd at least, as the last entry ended at 61.
This will surely lead us to much more corrections.