sanskrit-lexicon / MWS

Monier Monier-Williams, Sir; A Sanskrit-English dictionary. Oxford, 1899
Other
7 stars 5 forks source link

panini links in mw #120

Closed funderburkjim closed 2 years ago

funderburkjim commented 2 years ago

This issue reports on work done to improve the markup in MW of the literary sources for Panini. It is a continuation of work described in https://github.com/sanskrit-lexicon/csl-orig/issues/519.

The work is in two parts:

Andhrabharati commented 2 years ago

unbelievable, @funderburkjim, seeing my wish fulfilled.

I am overwhelmed with joy!

funderburkjim commented 2 years ago

normalization

panini_links_mw1.txt shows all the 8800+ Panini links before the changes.

panini_links_mw2.txt shows all the 9200+ Panini links after the changes.

change2.txt has the cumulative change transactions to lines of mw digitization that were applied. Note: the steps leading to these changes are discussed in the section ## change2.txt constructed manually in readme.txt

panini_links1.txt is a different view of paninI_links_mw2.txt, organized according to one of 13 different regular expression forms for the 'ls' element.

Andhrabharati commented 2 years ago

I now urge you to look at "(Capital letter) unique words extracted" file in the mws_issue_99 folder (under MWS) once, to get the untagged ls entries remaining.

This is a list of just about 1700 lines, and one can easily arrive at those untagged ls entries in them.

funderburkjim commented 2 years ago

The normalization changes can be also be reviewed via the csl-orig commit 39e8b33.

A small number of print changes were made, by reference to PWG and Katre's Panini edition. See the csl-corrections commit 40ff84e for details.

gasyoun commented 2 years ago

A small number of print changes were made, by reference to PWG and Katre's Panini edition. See the csl-corrections commit 40ff84e for details.

Absolutely mesmerizing. As usual.

funderburkjim commented 2 years ago

conversion to roman numerals

All the Panini links were changed to use roman numerals for the first (chapter) parameter. change_roman.txt has the change transactions for the lines of mw.txt.

panini_links_mw3.txt lists all the Panini links after the conversion to roman numerals.

funderburkjim commented 2 years ago

display links with roman numerals

In order for the links to https://ashtadhyayi.com/sutraani/1/2/3/ to work in the MW displays, changes were required to the php display program, specifically basicadjust.php. One copy of this revised file is in csl-apidev repository (for simple search, etc.), and another in csl-websanlexicon repository for basic display, etc. See links to the commits of these repositories above to review the changes to basicadjust module.

funderburkjim commented 2 years ago

@Andhrabharati

Will open new issue to look at Capital-letter.. for purpose of further untagged entries in mw.