sanskrit-lexicon / CORRECTIONS

Correction history for Cologne Sanskrit Lexicon
8 stars 5 forks source link

MW72 corrections to text Sanskrit Italics #320

Closed funderburkjim closed 7 years ago

funderburkjim commented 7 years ago

By a process similar to that of #318 for the AE dictionary, potential errors in the spelling of Roman Italic Sanskrit in MW72 have been developed.

This issue is devoted to those.

The potential corrections are examined by an interactive program. Here is the url for the first sample batch of 5 cases batch 00.

funderburkjim commented 7 years ago

Batch 00 has been completed.

Batch 01 and Batch 02 have been prepared are are ready to be analyzed.

I intend to work on these a bit at a time; maybe others can do some also 😃 .

These are all based on unusual 2-grams.

When these are finished, there will be a much larger (approx. 500) collection of 3-grams to examine.

drdhaval2785 commented 7 years ago

Life seems so simpler now with a UI. Now non technical people can also contribute better.

gasyoun commented 7 years ago

Now non technical people can also contribute better.

If only we could get the green blurb here. I spent most time searching the word on page.

By default the PDF it's fit to page.

I have to hit 6 times + to zoom in, so in full screen I can see all the details of the scan.

I googled http://stackoverflow.com/questions/770473/is-it-possible-to-have-a-pdf-file-open-at-a-predefined-magnification-in-adobe-re and found, that you can put parameters in the URL to control how the built-in reader will display the file.

For example, http://example.org/doc.pdf#zoom=50

will set the magnification to 50%.

funderburkjim commented 7 years ago

The green blurb is there!

image

Click as shown, and you get the same as with Greek in mw72 - a green blob near the line in question.

If you use Chrome, you will probably need one refresh. for the green highlighted section to show. The magnification is already large enough, in my experience.

gasyoun commented 7 years ago

The green blurb is there!

Ah, so I had to click on the line link, not the page link, got it! Now what is great!

van

If you use Chrome, you will probably need one refresh. for the green highlighted section to show.

None needed, it works like a charm, thanks!

The magnification is already large enough, in my experience.

Yes!

funderburkjim commented 7 years ago

@gasyoun Thanks for the feedback!

funderburkjim commented 7 years ago

One detail regarding a few cases of non-sanskrit words. In mw72.txt there is a new markup tag '` (Non-Sanskrit Italic) which is used to distinguish italicized words which are not Sanskrit words. Here's How I marked one such case in batch 02 image

Thanks to the user who already went through batch 02!

funderburkjim commented 7 years ago

Yudhisá¹­hira-

This is misspelled (a typo), as the retroflex sh goes with the retroflex á¹­h. I've adjusted such cases in batch 02 to make the correction 's' -> 'sh'.

funderburkjim commented 7 years ago

All the 2-gram cases (batch 00, 01, 02) are now completed. Ready to install.

drdhaval2785 commented 7 years ago

By not you mean now ?

funderburkjim commented 7 years ago

'now' - right.

Corrections for batches 00,01,02 now installed. 2-gram analysis finished.

gasyoun commented 7 years ago

2-gram analysis finished.

Give us more!