Open funderburkjim opened 4 years ago
The gra_preverb1 report is organized according to the GRA entries identified as verbs; each such entry is considered a 'case'. There are 907 cases corresponding to the 907 GRA root entries.
;; Case 0006: L=132, k1=aNg, k2=aNg, code=V, #upasargas=0, mw=aNg (same)
;; Case 0007: L=144, k1=ac, k2=ac, code=V, #upasargas=2 (1/1), mw=ac (same)
01 apa ac apAc apAc yes apa+ac
02 sam ac samac samac no
This record provides
mw=?
) where no correspondence currently identified.When upasargas are recognized within a verb entry, there are additional lines to the report. There is an extra line for each upasarga.
;; Case 0007: L=144, k1=ac, k2=ac, code=V, #upasargas=2 (1/1), mw=ac (same)
01 apa ac apAc apAc yes apa+ac
02 sam ac samac samac no
In each of these lines, there is
apAc
)apa+ac
)samac
);; Case 0020: L=954, k1=ar, k2=ar_f, code=V, #upasargas=14 (13/1), mw=f (diff)
01 anu ar anvar anvf yes anu+f
Note:
anvar
when using the GRA root spelling 'ar'anvf
when using the MW root spelling 'f'"¦ [mfna][.][, ]", # 2792 remain
"¦ adv[.][, ]", # 2783 remain
"¦, [mfna][.][, ]", # 2296 remain
"-.*?¦", # 2013 remain (hyphenated words
"[aá][,.})@]+¦", # 1482 remain
"tás[,.@})]+¦", # 1469 remain
"vát[,.@})]+¦", # 1433 remain
"ṣas[,.@})]+¦", # 1431 remain
"śás[,.@})]+¦", # 1420 remain
"¦ *[0-9]+[)] [mfna][.][, ]", # 1412 remain
"Ablativ", # 1407 remain
"ā́t[,.@})]+¦", # 1398 remain
"tas[,.@})]+¦", # 1388 remain
"á.[,.@})]+¦", # 1307 remain penultimate accented 'a'.
"āt[,.@})]+¦", # 1298 remain
"@[})]+¦[, ]*{@[^@]+@} *[mfna][.]", # 1212 remain more substantives
"[áa]thā.*¦", # 1199 remain
"é.*¦", # 1189 remain
"[áíú].*¦", # 1051 remain accented vowel in headword
"fem[.]", # 1036 remain
"ā́.*¦.*Instr.", # 1029 remain
"a[^@]+ā́[,.@})]+¦", # 1001 remain
"īm[,.@})]+¦", # 995
"vat[,.@})]+¦",
Examination of scanned entries suggests that upasargas appear as bold text: (root=aYj
)
Of course, there is also some bold text which does not represent an upasarga.
Thomas Malten's original digitization had markup for:
{@xxx@}
<div n="P">
or <div n="P1">
.Here is an extract from the digitization for aYj
:
<L>208<pc>0023<k1>aYj<k2>aYj
- Grundbedeutung „sch -->
+ Mit {@abhí,@} {%schm --> abhí,
+ {@ā́ @}1) die Bahn [ --> ā́
+ {@ní,@} {%hinuntersc --> ní,
+ {@prá,@} jemandem [D --> prá,
+ {@práti,@} {%schmück --> práti,
+ {@ví,@} med. 1) {%si --> ví,
+ {@sám@} 1) womit [I. --> sám
- -nákti 7) vām 153,2 -->
The file 'preverb0_dbg.txt' contains similar extract information for all the verbs.
The result 'gra_upasarga_map.txt' provides a mapping from bold upasarga text to SLP1 upasarga
spellings.
This mapping can then be used to generate a list of upasargas for each verb.
This is in file gra_preverb0.txt. The line for 'aYj' is:
;; Case 0010: L=208, k1=aYj, #upasargas=7, upasargas=aBi,A,ni,pra,prati,vi,sam
Finally, the two pieces:
can be combined to produce gra_preverb1.txt.
(same) means the cae headword spelling is the same as the spelling of the MW entry believed to correspond to the cae entry (705 cases)
Hope cae
is a copy-paste byproduct.
An example where GRA root spelling corresponds to a different MW root spelling:
GRA in his preface explains what conventions with dhatus he uses. Remember, we even made an English draft translation for the German Preface?
examined for several EXCLUSION patterns
Adorable intellectual work behind the patterns.
scanned entries suggests that upasargas appear as bold text
In original books it's a different font as well, not only bolded.
This mapping can then be used to generate a list of upasargas for each verb. ;; Case 0010: L=208, k1=aYj, #upasargas=7, upasargas=aBi,A,ni,pra,prati,vi,sam
Please advise me where to find a list of upasarga & dhatus combined only? pure list, seems gra_preverb1.txt
will be the one. Thanks again, as I use GRA a lot.
Hope cae is a copy-paste byproduct.
Yes, now corrected.
examined for several EXCLUSION patterns
Adorable intellectual work behind the patterns.
I've adopted a different approach altogether (and marked the entries with a √ symbol), and the summary is as below--
AB extra (L-entry): 614, 1055, 2103, 2436, 3501, 4000, 4285, 4288, ++6376.1, 6528, 6675.1, 7590, 7952, ++8788.1, 8803.1, 8954, 8955, 9602, 10336 [19] [Some of these are present in gra_verb_exclude.txt by Jim.]
Jim extra (L-entry):
1841, 2273, 2429, 3379, 4893, 5081, 5423, 7341, 7735.1, 7947, 8937, 9415, 10238 [13]
---------------------------
Next, I had reviwed the Jim's "extra" entries as found above, and noted the following--
Non-verbals:
The verbs01 directory aims
The comments here will focus on the gra_preverb1 report. gra_preverb1_deva is a Devanagari version of the report.
Currently, 907 of the 10777 entries of Grassman are identifed as verbs. 461 of these verbs have upasargas, and a total of 2026 upasargas (simple or compound) are identified.