funderburkjim / MWderivations

Derivations of headwords in the Monier-Williams (1899) dictionary
1 stars 1 forks source link

Variations in prefix 'ud' #4

Open funderburkjim opened 8 years ago

funderburkjim commented 8 years ago

In general, prefixes take various forms due to sandhi requirements. The analysis2 program handles this by presenting the variants in the list known_prefixes.

The variants uc, uj, and un have been added to the variants of ud , in addtion to the previously added variants ut and ul.

In addtion, when ud is prefixed to a nominal starting with the palatal sibilant S, not only does ud become uc, but also S becomes C. This identification was implemented within the pfx1 method (function analysis2_pfx1).

These added 74 additional derivations.

gasyoun commented 3 years ago

In https://github.com/funderburkjim/MWderivations/blob/master/step4/analysis2.py @funderburkjim

known_prefixes = [
 'a', 'an',  
 'aBi', 'aBy',
 'anu', 'anv',
 'ava', 'apa', 'apA',
 'aDi','aDy',
 'A',
 'upa', 
 'ut','ud','ul','uc','uj','un',
 'ni','ny',
 'nir','nih',
 'pari', 'pary',
 'pra', 'prati',
 'mahA'
 'vi','vy',
 'sam', 'saM', 
 'su', 'sv',
 'sa',
 'ati','aty',
 'antar',
 'sam','saM',
]

In 2014 we had some fun with Upasargārthacandrika (https://docs.google.com/document/d/1VjWwbmSRztdNtBvnm_CnRhvGLOf8IXA4-pQv4ezkr0A/edit#). There are 22 upasargas. प्र, परा, अप, सम्‌, अनु, अव, निस्‌, निर्‌, दुस्‌, दुर्‌, वि,आ (आङ्‌), नि, अधि, अपि, अति, सु, उत् /उद्‌, अभि, प्रति, परि, उप। Not all upasargas take all dhatus. Dhaval even printed a book उपसर्गमण्डनम् on the prefixes. In 2015 Dhaval wrote "I have a readymade list, made out of the upasargArthasiddhAntacandrikA":

$upasarga_combinations = array("ati,atinis,atipra,ativi,ativyA,atisam,atyati,atyaBi,atyA,atyud,atyupa,aDi,aDini,aDinis,aDivi,aDyava,aDyA,aDyupa,anu,anuni,anunis,anuparA,anupari,anuparyA,anupra,anuprati,anuvi,anuvyava,anuvyA,anusam,anusampra,anUd,anvapa,anvava,anvA,apa,apani,apanis,apaparA,apaparyA,apapra,apavyA,apA,apAti,api,apipari,apod,apyati,aBi,aBini,aBinis,aBiparA,aBipari,aBiparyA,aBipra,aBivi,aBivyA,aBisamA,aBisam,aByati,aByaDi,aByanu,aByapa,aByava,aByA,aByudA,aByud,aByupa,aByupA,aByupAva,ava,avani,avA,A,utpra,udava,udA,ud,udvi,unni,upa,upani,upanis,upanyA,upapari,upaparyA,upapra,upavi,upavyA,upasaṁni,upasamA,upasam,upA,upAti,upAva,upodA,upod,upopa,duHsam,duranu,durava,durA,durud,durupa,durni,duzpari,duzpra,dus,ni,nipra,nirati,niraDi,niranu,nirapa,niraBi,niraBi,nirava,nirupA,nirvi,nivyA,nizpra,nisu,nis,nyA,parA,pari,parini,parinis,paripra,parivi,parivyA,parisam,paryaDi,paryanu,paryava,paryA,paryud,paryupa,pra,praNi,prati,pratini,pratinis,pratiparA,pratipari,pratipra,prativi,prativyA,pratisam,pratyaDi,pratyanu,pratyapa,pratyapi,pratyaBi,pratyava,pratyA,pratyudA,pratyud,pratyupa,pratyupA,pravi,pravyA,prasam,prA,prADi,prod,vi,vini,vinis,viparA,vipari,viparyA,vipra,viprati,visam,vyati,vyanu,vyanvA,vyapa,vyapA,vyaBi,vyava,vyA,vyud,vyupa,saṁvi,saṁvyava,saṁvyA,sanni,samati,samaDi,samanu,samanuvi,samanvA,samapa,samapi,samaBi,samaBivyA,samaBisam,samaBisampra,samaByava,samaByA,samaByud,samava,samava,samavA,samA,samudA,samud,samupa,samupA,sam,samparA,sampari,sampra,samprati,samprA,samprod,sampvari,su,supari,suvi,susamA,svanu,svaBi,svaByA,");

gasyoun commented 3 years ago

@funderburkjim does the additional list makes sense to use?

funderburkjim commented 3 years ago

mwverbs seems related.

This document has both roots and prefixed roots in MW. And, the prefixed roots are 'parsed', in the sense that the prefix is analyzed. For example: atinizwan:3045:pre::ati+nis+tan shows that 'ati+nis' occurs as the prefix for the prefixed verb 'atinizwan'.

From this file, you could get all X for lines ending in ':X+Y'

You might weed out those values of X which are not composed of upasargas, such as in aMSI+kf.

This