Closed drdhaval2785 closed 7 years ago
Here is the digitzation for kuwwa, from ap90_orig_utf8.txt:
<P>.{#ku¤TTa#}¦ {%a.%} (At the end of comp.) Di-
<>viding, cutting; grinding. {#{@--¤TTaH@}#} (in
<>Math.) A multiplier.
Note the presence of that special character in the 2nd line also.
0406-c: 'kujJawiH, kujJawikA, kujJawI' => 'kujJawiH' :69334,69335 0406-c: 'kuMjaH, --jaM' => 'kuMjaH' :69348,69358 0407-a: 'kuwika --ta' => 'kuwika' :69384,69384 0407-a: 'kuwaH, --waM' => 'kuwaH' :69385,69393 0407-b: 'kuwIraH, --raM, kuwIrakaH' => 'kuwIraH' :69448,69450 0407-b: 'kuwuMbaM, kuwuMbakaM' => 'kuwuMbaM' :69458,69473 0407-c: 'kuwuMbikaH, kuwuMbin' => 'kuwuMbikaH' :69474,69489 0407-c: 'ku¤wwa' => 'kuwwa' :69494,69496 0407-c: 'ku¤wwakaH' => 'kuwwakaH' :69497,69497 0407-c: 'ku¤wwanaM' => 'kuwwanaM' :69498,69499
There are all in all 70 lines of the ap90.txt digitization where that ¤
character appears.
I looked at the scan for a couple of the others, and in those cases also that character appears to be a typo.
Thus, I think it is safe to consider all occurrences of that character to be typos, and will remove all of them from the digitization.
kuwwa, I see nothing that would warrant a special character between 'u' and 'w'.
so do I.
Thus, I think it is safe to consider all occurrences of that character to be typos, and will remove all of them from the digitization.
seems fine, no meta data involved.
corrections installed.
While running the code, program stopped on some non-ASCII entry.