drdhaval2785 / SanskritSpellCheck

spell checking based on patterns
1 stars 1 forks source link

Methodology of this code #6

Closed drdhaval2785 closed 9 years ago

drdhaval2785 commented 10 years ago

Let me document the method I have adopted to find the suspected wrong entries in https://github.com/sanskrit-lexicon/CORRECTIONS/issues/2. Code for checking is attached here. Google doc for logic behind approach and instructions for dictionary /testers. Video tutorial for code running - http://youtu.be/qLqYUZUGM6M Video tutorial for noting issues on this correction forum - https://www.youtube.com/watch?v=rKZ_OsSHwsY

In short - the logic is as follows

  1. We check the base dictionary for different Vowel and consonant patterns like VV, VCV, VCCV, VCCCV etc. (We presume that the base dictionary has correct entries - which need not be true).
  2. We check the dictionary to be tested for different vowel and consonant patterns like 1 above.
  3. If the pattern is not found in 1, but found in 2 - we note that in suspectfalse.html.
  4. We check those entries from dictionary scans and verify.
  5. If wrong word is detected, it is intimated to the webmaster here.