Open funderburkjim opened 8 years ago
In https://github.com/funderburkjim/MWderivations/blob/master/step4/analysis2.py @funderburkjim
known_prefixes = [
'a', 'an',
'aBi', 'aBy',
'anu', 'anv',
'ava', 'apa', 'apA',
'aDi','aDy',
'A',
'upa',
'ut','ud','ul','uc','uj','un',
'ni','ny',
'nir','nih',
'pari', 'pary',
'pra', 'prati',
'mahA'
'vi','vy',
'sam', 'saM',
'su', 'sv',
'sa',
'ati','aty',
'antar',
'sam','saM',
]
In 2014 we had some fun with Upasargārthacandrika
(https://docs.google.com/document/d/1VjWwbmSRztdNtBvnm_CnRhvGLOf8IXA4-pQv4ezkr0A/edit#). There are 22 upasargas. प्र, परा, अप, सम्, अनु, अव, निस्, निर्, दुस्, दुर्, वि,आ (आङ्), नि, अधि, अपि, अति, सु, उत् /उद्, अभि, प्रति, परि, उप। Not all upasargas take all dhatus. Dhaval even printed a book उपसर्गमण्डनम् on the prefixes. In 2015 Dhaval wrote "I have a readymade list, made out of the upasargArthasiddhAntacandrikA":
$upasarga_combinations = array("ati,atinis,atipra,ativi,ativyA,atisam,atyati,atyaBi,atyA,atyud,atyupa,aDi,aDini,aDinis,aDivi,aDyava,aDyA,aDyupa,anu,anuni,anunis,anuparA,anupari,anuparyA,anupra,anuprati,anuvi,anuvyava,anuvyA,anusam,anusampra,anUd,anvapa,anvava,anvA,apa,apani,apanis,apaparA,apaparyA,apapra,apavyA,apA,apAti,api,apipari,apod,apyati,aBi,aBini,aBinis,aBiparA,aBipari,aBiparyA,aBipra,aBivi,aBivyA,aBisamA,aBisam,aByati,aByaDi,aByanu,aByapa,aByava,aByA,aByudA,aByud,aByupa,aByupA,aByupAva,ava,avani,avA,A,utpra,udava,udA,ud,udvi,unni,upa,upani,upanis,upanyA,upapari,upaparyA,upapra,upavi,upavyA,upasaṁni,upasamA,upasam,upA,upAti,upAva,upodA,upod,upopa,duHsam,duranu,durava,durA,durud,durupa,durni,duzpari,duzpra,dus,ni,nipra,nirati,niraDi,niranu,nirapa,niraBi,niraBi,nirava,nirupA,nirvi,nivyA,nizpra,nisu,nis,nyA,parA,pari,parini,parinis,paripra,parivi,parivyA,parisam,paryaDi,paryanu,paryava,paryA,paryud,paryupa,pra,praNi,prati,pratini,pratinis,pratiparA,pratipari,pratipra,prativi,prativyA,pratisam,pratyaDi,pratyanu,pratyapa,pratyapi,pratyaBi,pratyava,pratyA,pratyudA,pratyud,pratyupa,pratyupA,pravi,pravyA,prasam,prA,prADi,prod,vi,vini,vinis,viparA,vipari,viparyA,vipra,viprati,visam,vyati,vyanu,vyanvA,vyapa,vyapA,vyaBi,vyava,vyA,vyud,vyupa,saṁvi,saṁvyava,saṁvyA,sanni,samati,samaDi,samanu,samanuvi,samanvA,samapa,samapi,samaBi,samaBivyA,samaBisam,samaBisampra,samaByava,samaByA,samaByud,samava,samava,samavA,samA,samudA,samud,samupa,samupA,sam,samparA,sampari,sampra,samprati,samprA,samprod,sampvari,su,supari,suvi,susamA,svanu,svaBi,svaByA,");
@funderburkjim does the additional list makes sense to use?
mwverbs seems related.
This document has both roots and prefixed roots in MW.
And, the prefixed roots are 'parsed', in the sense that the prefix is analyzed.
For example: atinizwan:3045:pre::ati+nis+tan
shows that 'ati+nis' occurs as the
prefix for the prefixed verb 'atinizwan'.
From this file, you could get all X for lines ending in ':X+Y'
You might weed out those values of X which are not composed of upasargas,
such as in aMSI+kf
.
This
In general, prefixes take various forms due to sandhi requirements. The analysis2 program handles this by presenting the variants in the list
known_prefixes
.The variants
uc, uj, and un
have been added to the variants ofud
, in addtion to the previously added variantsut and ul
.In addtion, when
ud
is prefixed to a nominal starting with the palatal sibilantS
, not only doesud
becomeuc
, but alsoS
becomesC
. This identification was implemented within thepfx1
method (functionanalysis2_pfx1
).These added 74 additional derivations.