sanskrit-lexicon / MWS

Monier Monier-Williams, Sir; A Sanskrit-English dictionary. Oxford, 1899
Other
7 stars 5 forks source link

MWS accent correction, continue, phase 4 #145

Closed funderburkjim closed 1 year ago

funderburkjim commented 1 year ago

Further review of accents in MWS., based on the version of MW at sanskrit-lexicon/MWS#142; Namely, version of mw.txt in sanskrit-lexicon/csl-orig repository at v02/mw/mw.txt at commit 360db2b.

Andhrabharati commented 1 year ago

I'm not sure how to get your 460.

My first thought was that by 'comma between' you were talking about commas in the 'k2' field of metalines, which occurs when a word shows two accent patterns. I include the 'multiple accent patterns' comment below for possible future reference.

As I continued marking the MW data in my intended way [if not used at CDSL, useful for someone else in future; most probably at our own site, wherein we did not pursue updating the Skt. Dictionaries for past 6 years!!], (accidentally) came across <L>40710 and <L>40711 which differ just in accent position, and I got reminded of this above remark by @funderburkjim !!

Though these two are not marked as a OR-group, they do exist as different entries. And I am sure, the MW data is of such a vast size that there could be more such instances in it.

Just wanted to bring this info to Jim's notice, whether he agrees to make all such 'eligible' words to be separate entries or not.

But then I realized that you were likely talking about cases that could be considered as alternate headwords which have been 'missed' -- and any such might very well require new entries.

And I am seeing that too many (running into couple of thousands) HWs-- apart from the above mentioned 460 cases, are 'missed' that could be made as separate entries; some as grouped (OR and AND) and some of other type.

gasyoun commented 1 year ago

Saturday better than Sunday

Hard for me, as many Sanskrit classes are there. Evening Sunday is bad for Dhaval. Saturday at best I have an hour in between, like 7 or 8th of January.

And I am seeing that too many (running into couple of thousands) HWs-- apart from the above mentioned 460 cases, are 'missed' that could be made as separate entries; some as grouped (OR and AND) and some of other type.

I am your fan.

funderburkjim commented 1 year ago

@gasyoun choose a day and time for your side of the world. I'll probably be able to join the meeting.

funderburkjim commented 1 year ago

@gasyoun AFAIK, as of this moment, NO meeting day/time is set.

funderburkjim commented 1 year ago

oma/n and o/man


CURRENT
<L>40710<pc>236,1<k1>oman<k2>oma/n<e>2
<s>oma/n</s> ¦ <s>A</s>, <lex>m.</lex> help, protection, favour, kindness, <ls>RV.</ls><info lex="m"/>
<LEND>
<L>40711<pc>236,1<k1>oman<k2>o/man<e>2B
<s>o/man</s> ¦ (<s>o/man</s>, <s>A</s>), <lex>m.</lex> a friend, helper, protector, <ls>RV. v, 43, 13.</ls><info lex="m"/>
<LEND>

BETTER -- 40711 should be '2A', not '2B'

40711236,1omano/man2A Reason: same gender as 40710. It is just 'coincidental' that 40710 and 40711 differ in accent. ``` ## missed alt headwords I think of there being several 'kinds' of alternate headwords identifiable in MW * DAtu - two spellings of same root * Example vrad or vrand * substantive or indeclineable. Author notes two words referring to same object * `pAdukA—kAra or pAdukA—kft, ¦ m. a shoemaker,` * `a-kutra or (Ved.) a-ku/trA` * 'see' groups. text refers reader to another part of the dictionary for several words * `Bra-kuMSa or °sa, ¦ Bra-kuYca, Bra-kuwi &c. See under BrU, p. 771, col. 1` * lists of works * navonavavyAKyA and navOcityavicAracarcA Currently not marked as 'and' -- this seems right * The long list of works under 'navya'. Currently marked as 'or' group, but should not be so marked. I see no good reason to adhere to the MW printed text in these last two cases, Better to follow model of navonavavyAKyA and navOcityavicAracarcA (i.e., unpack the lists into separate unrelated entries). I am sure there is currently inconsistency in mw.txt especially in the last two ('see' and 'list of works') cases.