sanskrit-lexicon / PWK

Sanskrit-Wörterbuch in kürzerer Fassung, 7 Bände Petersburg 1879-1889
3 stars 1 forks source link

bib minus cref, part 5 #37

Closed drdhaval2785 closed 8 years ago

drdhaval2785 commented 8 years ago

After the completion of 4 parts and some force matching, the bibminuscref.txt file has only 13 entries as of now.

ANUKRAM.zuR2V
KA7TJ(A7JANA)
KA7TJ.SANA7NAS
DRAVJAC2
SADDH.P.4
K4ANDRA7LOKA
HIT.ed.JOHNS
ka7tj
SAM5KSHEPAC2
ALAM5KA7RAV
ALAM5KA7RAR
DEVATA7DHJ.BRA7HM
A7TMOPAN

This issue is dedicated to their resolution, so that the problem may be sent to its logical conclusion.

drdhaval2785 commented 8 years ago

1 ANUKRAM.zuR2V

pw.txt has only one entry with ¯ANUKR i.e. ¯ANUKR. ‹zu› ¯R2V.3,33 ‹bei› ¯SA7J. Full entry

<H1>100{uttitIrzu}1{uttitIrzu}¦ •Adj. ²1) {%herauszusteigen ~ , herauszukommen beabsichtigend aus%} (•Abl.) ¯Ind.St.9,148. ²2) {%überzusetzen beabsichtigend%} ¯ANUKR. ‹zu› ¯R2V.3,33 ‹bei› ¯SA7J. {%über%} (•Acc.). PW18320

pw.txt has ¯ANUKR. ‹zu› ¯R2V. Therefore, pwbib0.txt has to be changed from ANUKRAM. zu R2V.->ANUKR. zu R2V. See capture

drdhaval2785 commented 8 years ago

2 KA7TJ(A7JANA)

There is no entry matching ¯KA7TJA in pw.txt And those with ¯KA7TJ have something suffixed. e.g.


.KA7TJ.C2R. == KA7TJA7JANA'S C2RAUTSU7TRA, nach WEBER'S Index. (vol. 1)
.KA7TJ. DH. == KA7TJA7JANA'S DHARMAC2A7STRA (STENZILER). (vol. 1)
.KA7TJ. SANA7NAS. == KA7TJA7JANA'S SNA7NASU7TRA (STENZLER). (vol. 1)

They are already mentioned in pwbib0.txt

It seems that the pwbib0.txt has mentioned KA7TJ(A7JANA) just to give us an idea what this short form KA7TJ in these compounds stand for. unused list. Not a separate resource.

drdhaval2785 commented 8 years ago

3 KA7TJ.SANA7NAS The work is kAtyAyana snAnasUtra (not sanAna) Also there is unwanted space before SNANAS

So, typo in pwbib0.txt .KA7TJ. SANA7NAS. == KA7TJA7JANA'S SNA7NASU7TRA (STENZLER). (vol. 1) -> .KA7TJ.SNA7NAS. == KA7TJA7JANA'S SNA7NASU7TRA (STENZLER). (vol. 1)

drdhaval2785 commented 8 years ago

4 DRAVJAC2 Not found in pw.txt

Only entry with DRAVJA is ¯DRAVJAV. See

<H1>100{agraja}1{agraja}¦ ²1) •Adj. {%zuerst geboren.%} ²2) •m. ³a) {%ein älterer Bruder.%} ³b) {%ein Brahman.%} ³c) {%*eine Krähenart%} ¯DRAVJAV. ²3) •*f. #{A} {%eine ältere Schwester.%} PW732
gasyoun commented 8 years ago

ANUKR should be forcemached to this.

Agree.

It seems that the pwbib0.txt has mentioned KA7TJ(A7JANA) just to give us an idea what this short form KA7TJ in these compounds stand for.

unused list. Not a separate resource.

It's an abstraction as root is. When the work started nobody was aware what exactly works will be used and forgot in vol 7 how they quoted in vol 2 after 22 years of work.

Agree.

drdhaval2785 commented 8 years ago

5 SADDH.P.4 The pwbib0.txt entry reads

.SADDH. P. 4 == das 4te Kapitel des SADDHARMAPUD2N2ARIKA, lithographirt in Parabole de l'Enfant egare. Par. PH. ED. FOUCAUX. Paris. 1854. (vol. 1)

It means that this is 4th chapter of saddharma...

When I tested with ¯SADDH[.]P[.] in pw.txt, there are 55 such cases. And with ¯SADDH[.]P[.]4 there are only 6 matches.

We should change .SADDH. P. 4 to .SADDH.P.` in pwbib0.txt. Also there is a typo in the full form.

So final correction is .SADDH. P. 4 == das 4te Kapitel des SADDHARMAPUD2N2ARIKA, -> .SADDH.P. == SADDHARMAPUD2N2ARIKA,

Removed the 4th chapter reference.

drdhaval2785 commented 8 years ago

6 K4ANDRA7LOKA

Not used. even ¯K4ANDR didn't match anything in pw.txt

drdhaval2785 commented 8 years ago

7 HIT.ed.JOHNS

¯HIT.Ed.JOHNS.->¯HIT.ed.JOHNS. in pw.txt needed. small letter 'ed.'

<H1>000{aBinavayOvana}1{aBinavayOvana}¦ (¯HIT.Ed.JOHNS.1336. ‹und› #{aBinavavayaska} (¯Chr.148,12) •Adj. (•f. #{A}) {%in der ersten Jugend stehend.%} PW7402
drdhaval2785 commented 8 years ago

8 ka7tj Same as member 2. Small letter version

drdhaval2785 commented 8 years ago

9 SAM5KSHEPAC2 Neither SAM5K nor SAN5K found in pw.txt

unused

drdhaval2785 commented 8 years ago

10 ALAM5KA7RAV ¯ALAM5KA7DRAV -> ¯ALAM5KA7RAV in pw.txt

<H1>100{samutpuMsana}1{samutpuMsana}¦ •n. {%das Wegwischen , Entfernen , zu Nichte Machen%} ¯ALAM5KA7DRAV.138, ‹a.› PW120638
drdhaval2785 commented 8 years ago

11 ALAM5KA7RAR

not used

See the matches (case insensitive) of ALAM5KA7R. None matches alaMkAraratnAkara.

    Line 78588: <H1>100{cakravartin}1{cakravartin}¦ ²1) •Adj. {%auf Rädern rollend.%} ²2) •m. ³a) {%dessen Räder ungehemmt über die Länder rennen , Weltherrscher%} ; ‹Herrscher über (im Comp. vorangehend)› ¯VIKRAMA7N5KAK4.1031. ³b) {%Oberhaupt%} ‹überh.› , {%der unter%} ‹--› (•Gen. ‹oder im Comp. vorangehend)› {%der unter%} ‹--› (•Gen. ‹oder im Comp. vorangehend)› {%den höchsten Rang einnimmt%} ; ‹auch von Unbelebtem.› ³c) {%®*Chenopodium_album%} ¯RA7G4AN.7,121. ³d) ‹N.pr. eines Commentators des †Alam5ka7rasarvasva› ¯KUMA7RASV. ‹zu› ¯PRATA7PAR.46,27. ‹u.s.w.› ²3) •f. #{°nI} ³a) •f. ‹zu 2)b).› ³b) {%*eine best. wohlriechende Pflanze%} , = #{jantukA} ¯RA7G4AN.3,114. ³c) {%®*Nardostachys_Jatamansi%} ¯RA7G4AN.12,98. ³d) {%*rother Lack%} ¯RA7G4AN.6,209. PW38291
    Line 193076: <H1>500{rAjahaMsIy}1{rAjahaMsIy}¦ , #{°yati} {%sich wie eine%} #{rAjahaMsI} {%gebaren%} ¯ALAM5KA7RAT.15, ‹b.› PW93588
    Line 223036: <H1>100{vEbuDa}1{vEbuDa}¦ •Adj. (•f. #{I}) {%den Göttern eigen , göttlich%} ¯C2IC2.18,63.¯ALAM5KA7RAT.1, ‹b.› PW107796
    Line 249444: <H1>100{samuttejana}1{samuttejana}¦ •n. {%das Anfeuern , Aufreizen%} ¯MUDRA7R.55,19…(88,12).¯ALAM5KA7RAC2.17,b,1. PW120625
    Line 276527: <H1>100{harahAra}1{harahAra}¦ •m. {%†C2iva's Perlenschmuck%} , ‹Bez. des Schlangendämons †C2esha› ¯VA7GAHAT2A ‹in› ¯ALAM5KA7RAT.2, {%a.%} PW133617
    Line 276541: <H1>100{harAvAsa}1{harAvAsa}¦ •m. ²1) ‹Civa's› {%Wohnstätte.%} ²2) ‹Bein.› ‹des †Kaila7sa› ¯VA7GBHAT2A ‹in› ¯ALAM5KA7RAT.2, ‹a› PW133624
    Line 278114: <H1>100{hAwakIya}1{hAwakIya}¦ •Adj. ‹dass.› ¯ALAM5KA7RAC2.18,a,1 ‹nach› ¯AUFRECHT. PW134362
drdhaval2785 commented 8 years ago

12 DEVATA7DHJ.BRA7HM

¯D4VATA7DHJ.bra7hm. -> ¯DEVATA7DHJ.BRA7HM. in pw.txt

<H1>100{utsnAna}1{utsnAna}¦ •n. {%das Auftauchen aus dem Wasser%} ¯D4VATA7DHJ.bra7hm.3. PW18547
drdhaval2785 commented 8 years ago

13 A7TMOPAN

¯ATMOPAN -> ¯A7TMOPAN in pw.txt

<H1>100{avarRya}1{avarRya}¦ •Adj. ²1) {%nicht zu beschreiben%} ¯ATMOPAN.3 ²2) {%nicht auszusagen , ~ zu prädiciren%} •Comm. ‹zu› ¯NJA7JAS.5,1,4. PW10616
drdhaval2785 commented 8 years ago

@funderburkjim This ends correction submission in this thread. Hopefully all of this file dies or matches after this correction.

funderburkjim commented 8 years ago

.SADDH. P. 4 == das 4te Kapitel des SADDHARMAPUD2N2ARIKA, -> .SADDH.P. == SADDHARMAPUD2N2ARIKA,

  • I'm sure you meant ...I7KA = sadDarmapuRqarIka for the correction.
  • Regarding removal of the '.4' in pwbib0, I disagree.
  • There is already a 'SADDH.P' in pwbib0:
```
.SADDH. P. == SADDHARMAPUN2D2ARI7KA (KERN) . (vol. 4)

.SADDH. P. 4 == das 4te Kapitel des SADDHARMAPUD2N2ARI7KA, lithographirt in Parabole de l'Enfant egare. Par. PH. ED. FOUCAUX. Paris. 1854. (vol. 1

```
funderburkjim commented 8 years ago

Re 8 ka7tj small letter version:

funderburkjim commented 8 years ago

Re 13 A7TMOPAN I don't find this in pw.txt, but , but A7TMOPAN is found under 'avarRya' So this correction must have been previously corrected.

funderburkjim commented 8 years ago

The bibminuscref list of 13 above and the current list ( #36 ) are identical, except for the last item.

36 has 'KAP.S' as the last item:

.KAP. S. == KAPISHT2HALAKAT2HASAM5HITA, Hdschr. (V. SCHRÖDER). (vol. 2)

It was necessary to adjust abbrv.py so that this 'S' would NOT be dropped.

funderburkjim commented 8 years ago

Here are the pw.txt corrections from this issue. not yet installed

; hw = Sram
¯SADDH.P.4.19.@¯SADDH.P.4,19.@t@ period to comma after '4  '    
; hw = aBinavayOvana
¯HIT.Ed.JOHNS.1336.@¯HIT.ed.JOHNS.1336.@t@ 
; hw = samutpuMsana
¯ALAM5KA7DRAV.138, ‹a.›@¯ALAM5KA7RAV.138,a.@t@
; hw = utsnAna
¯D4VATA7DHJ.bra7hm.3.@¯DEVATA7DHJ.BRA7HM.3.@t@
; hw = uttitIrzu
¯ANUKR. ‹zu› ¯R2V.3,33@¯ANUKR. ‹zu› ¯R2V.3,33@t@ markup
drdhaval2785 commented 8 years ago

@funderburkjim I just want to confirm how the code works in case the correction is found in more than one entries ? I would like it to replace all the occurrences.

funderburkjim commented 8 years ago

Corrections now installed.

bibminuscref now has 0 records!

There are now 66722 records of abbrvlist.txt that match out of 72758.0 (91.7%).

funderburkjim commented 8 years ago

Part of crefmatch.py identifies records of pwbib0 as unused. The list (as shown in crefmatch_log.txt) is

31 known unused records removed from pwbib for purposes of matching
Case 01: gan2a : im Gaṇapātha zu P. 
Case 02: HARISV. : HARISVĀMIN. 
Case 03: MAHA7B. : MAHĀBHĀṢYA, lith. in Benares. 
Case 04: MAYR, Ind. Erb. : Das indische Erbrecht von AUREL MAYR, Wien 1873. 
Case 05: NI7LAK. mit einer Zahl : NĪL. 
Case 06: BÜHLER, Rep. 1872-73 : BÜHLER, Report on Sanskrit Mss. 1872--73. 
Case 07: C2OBH. : COBHANAMUNI'S ŚOBHANASTUTAYAS in Z. d. d. m. G. 32, 509. fgg. (YACOBI). 
Case 08: Ind.Str. : WEBER, Indische Streifen. 
Case 09: MAITR. PADDH. : PADDHATI ZU MAITR. S. , Hdschr. (V. SCHRÖDER). 
Case 10: GAN2ITA,MADHJA7M(A7DHJA7JA) : (KERN). 
Case 11: KA7TJ(A7JANA) : nach Citaten in verschiedenen juristischen Werken(YOLLY). 
Case 12: KA7TJ. C2RA7DDHAK. : KĀTYĀYANA'S ŚRĀDDHAKALPASŪTRA (STENZLER). 
Case 13: PRAG4A7PATI : , nach Citaten in verschiendenen juristischen Werken (YOLLY). 
Case 14: KIELHORN, Rep. : Report on the search for Sanskrit Mss. in the Bombay Presidency, during the year 1880-81. by F. KIELHORN(ROTH) . 
Case 15: SVAPNAK4(INTA7MAN2I) : (PISCHEL) . 
Case 16: ALAM5KA7RAR : (ATNĀKARA) des ŚOBHĀKARA, Deccan. Coll. Hdschr. No. 227 (PISCHER). 
Case 17: ALAM5KA7RAS : (ARVASVA) von RUYYAKA, Deccan Coll. Hdschr. No. 231 (PISCHER). 
Case 18: C2RIMA7LA7M : (ĀHĀTMYA), Hdschr. (AUFRECHT). 
Case 19: DEVI7BHA7G : (AVATAPURĀṆA), Hdschr. (AUFRECHT). 
Case 20: DRAVJAC2 : (UDDHITATTVA), Hdschr. (AUFRECHT). 
Case 21: KA7VJA7L : (OKALOCANA), Hdschr. (AUFRECHT). 
Case 22: KHAN2D2APR : (AŚASTI) im Pandit, Bed. 5 und 6 (AUFRECHT). 
Case 23: K4HA7NDOGJAP : (ARIŚIṢṬA) nach Citaten in andern Werken (AUFRECHT). 
Case 24: LEUMANN, Aup. Gl. : Glossar in. , Das Aupapātika Sūtra, herausgegeben von Dr. ERNST LEUMANN. 
Case 25: SAM5KSHEPAC2 : (AM̃KARAJAYA) von MĀDHAVA (AUFRECHT). 
Case 26: VA7MANAP : (URĀṆA), Hdschr. (AUFRECHT). 
Case 27: VA7RA7HAP : (URĀNA) , Hdschr. (YACOBI). 
Case 28: Bydragen : (tot de Taal-, Land-en Volkenkunde van Nederlandsch Indie) (H. KERN). 
Case 29: K4ANDRA7LOKA : (R. PISCHEL). 
Case 30: KA7VJA7L : (AM̃KĀRA des RUDRAṬA) (R. PISCHEL). 
Case 31: SAHR2DAJA7LOKA : (R. PISCHEL). 
funderburkjim commented 8 years ago

Probably ok to close this issue.

funderburkjim commented 8 years ago

Will continue with #38 tomorrow.

Happy New Year!

drdhaval2785 commented 8 years ago

bibminuscref now has 0 records!

Glad to see at least one file reaching its logical conclusion. Closing this issue.

Also updated the submission files and github.io files after regeneration as on 1.1.2016

Happy new year.

gasyoun commented 8 years ago

31 sources stated in Preface and not used in Book, @funderburkjim ? How many different sources in total used in PWK?

funderburkjim commented 8 years ago

Re how the code works in case the correction is found in more than one entries.

Based on your comment several issues ago, I realized that your convention in submitting corrections was that a correction is to be thought of as 'generic', in the sense that there may be more instances needing correction than the specific one shown in the correction. Thus, I have been doing a 'change preparation step', which takes your corrections as input, and searches for all matches in pw.txt, regardless of the headword, and generates change records for all such.

funderburkjim commented 8 years ago

@gasyoun Re How many different sources in total used in PWK?

According to crefmatch_log.txt file, there are 502 pwbib records, before adjustment. From this are removed:

This results in 454 records from pwbib which are 'available for use' in matching to references in pw.txt.

Exactly how many of these actually DO have a match in some pw.txt reference I don't currently know; this might be a useful statistic to generate sometime.

drdhaval2785 commented 8 years ago

regardless of the headword, and generates change records for all such.

Bingo!

gasyoun commented 8 years ago

@funderburkjim thanks for 454. It's interesting to see that PD tried to get over 500 and Werba tried as well - and the number is always about 500. Was not aware of bib duplicate abbrv:.