Closed drdhaval2785 closed 8 years ago
1 ANUKRAM.zuR2V
pw.txt has only one entry with ¯ANUKR i.e. ¯ANUKR. ‹zu› ¯R2V.3,33 ‹bei› ¯SA7J.
Full entry
<H1>100{uttitIrzu}1{uttitIrzu}¦ •Adj. ²1) {%herauszusteigen ~ , herauszukommen beabsichtigend aus%} (•Abl.) ¯Ind.St.9,148. ²2) {%überzusetzen beabsichtigend%} ¯ANUKR. ‹zu› ¯R2V.3,33 ‹bei› ¯SA7J. {%über%} (•Acc.). PW18320
pw.txt has ¯ANUKR. ‹zu› ¯R2V.
Therefore, pwbib0.txt has to be changed from ANUKRAM. zu R2V.
->ANUKR. zu R2V.
See
2 KA7TJ(A7JANA)
There is no entry matching ¯KA7TJA in pw.txt And those with ¯KA7TJ have something suffixed. e.g.
.KA7TJ.C2R. == KA7TJA7JANA'S C2RAUTSU7TRA, nach WEBER'S Index. (vol. 1)
.KA7TJ. DH. == KA7TJA7JANA'S DHARMAC2A7STRA (STENZILER). (vol. 1)
.KA7TJ. SANA7NAS. == KA7TJA7JANA'S SNA7NASU7TRA (STENZLER). (vol. 1)
They are already mentioned in pwbib0.txt
It seems that the pwbib0.txt has mentioned KA7TJ(A7JANA)
just to give us an idea what this short form KA7TJ
in these compounds stand for.
unused list.
Not a separate resource.
3 KA7TJ.SANA7NAS The work is kAtyAyana snAnasUtra (not sanAna) Also there is unwanted space before SNANAS
So, typo in pwbib0.txt
.KA7TJ. SANA7NAS. == KA7TJA7JANA'S SNA7NASU7TRA (STENZLER). (vol. 1)
-> .KA7TJ.SNA7NAS. == KA7TJA7JANA'S SNA7NASU7TRA (STENZLER). (vol. 1)
4 DRAVJAC2 Not found in pw.txt
Only entry with DRAVJA is ¯DRAVJAV
.
See
<H1>100{agraja}1{agraja}¦ ²1) •Adj. {%zuerst geboren.%} ²2) •m. ³a) {%ein älterer Bruder.%} ³b) {%ein Brahman.%} ³c) {%*eine Krähenart%} ¯DRAVJAV. ²3) •*f. #{A} {%eine ältere Schwester.%} PW732
ANUKR should be forcemached to this.
Agree.
It seems that the pwbib0.txt has mentioned KA7TJ(A7JANA) just to give us an idea what this short form KA7TJ in these compounds stand for.
unused list. Not a separate resource.
It's an abstraction as root is. When the work started nobody was aware what exactly works will be used and forgot in vol 7 how they quoted in vol 2 after 22 years of work.
Agree.
5 SADDH.P.4 The pwbib0.txt entry reads
.SADDH. P. 4 == das 4te Kapitel des SADDHARMAPUD2N2ARIKA, lithographirt in Parabole de l'Enfant egare. Par. PH. ED. FOUCAUX. Paris. 1854. (vol. 1)
It means that this is 4th chapter of saddharma...
When I tested with ¯SADDH[.]P[.]
in pw.txt, there are 55 such cases.
And with ¯SADDH[.]P[.]4
there are only 6 matches.
We should change .SADDH. P. 4
to .SADDH.P.` in pwbib0.txt.
Also there is a typo in the full form.
So final correction is
.SADDH. P. 4 == das 4te Kapitel des SADDHARMAPUD2N2ARIKA,
-> .SADDH.P. == SADDHARMAPUD2N2ARIKA,
Removed the 4th chapter reference.
6 K4ANDRA7LOKA
Not used.
even ¯K4ANDR
didn't match anything in pw.txt
7 HIT.ed.JOHNS
¯HIT.Ed.JOHNS.
->¯HIT.ed.JOHNS.
in pw.txt needed. small letter 'ed.'
<H1>000{aBinavayOvana}1{aBinavayOvana}¦ (¯HIT.Ed.JOHNS.1336. ‹und› #{aBinavavayaska} (¯Chr.148,12) •Adj. (•f. #{A}) {%in der ersten Jugend stehend.%} PW7402
8 ka7tj Same as member 2. Small letter version
9 SAM5KSHEPAC2
Neither SAM5K
nor SAN5K
found in pw.txt
unused
10 ALAM5KA7RAV ¯ALAM5KA7DRAV -> ¯ALAM5KA7RAV in pw.txt
<H1>100{samutpuMsana}1{samutpuMsana}¦ •n. {%das Wegwischen , Entfernen , zu Nichte Machen%} ¯ALAM5KA7DRAV.138, ‹a.› PW120638
11 ALAM5KA7RAR
not used
See the matches (case insensitive) of ALAM5KA7R
. None matches alaMkAraratnAkara.
Line 78588: <H1>100{cakravartin}1{cakravartin}¦ ²1) •Adj. {%auf Rädern rollend.%} ²2) •m. ³a) {%dessen Räder ungehemmt über die Länder rennen , Weltherrscher%} ; ‹Herrscher über (im Comp. vorangehend)› ¯VIKRAMA7N5KAK4.1031. ³b) {%Oberhaupt%} ‹überh.› , {%der unter%} ‹--› (•Gen. ‹oder im Comp. vorangehend)› {%der unter%} ‹--› (•Gen. ‹oder im Comp. vorangehend)› {%den höchsten Rang einnimmt%} ; ‹auch von Unbelebtem.› ³c) {%®*Chenopodium_album%} ¯RA7G4AN.7,121. ³d) ‹N.pr. eines Commentators des †Alam5ka7rasarvasva› ¯KUMA7RASV. ‹zu› ¯PRATA7PAR.46,27. ‹u.s.w.› ²3) •f. #{°nI} ³a) •f. ‹zu 2)b).› ³b) {%*eine best. wohlriechende Pflanze%} , = #{jantukA} ¯RA7G4AN.3,114. ³c) {%®*Nardostachys_Jatamansi%} ¯RA7G4AN.12,98. ³d) {%*rother Lack%} ¯RA7G4AN.6,209. PW38291
Line 193076: <H1>500{rAjahaMsIy}1{rAjahaMsIy}¦ , #{°yati} {%sich wie eine%} #{rAjahaMsI} {%gebaren%} ¯ALAM5KA7RAT.15, ‹b.› PW93588
Line 223036: <H1>100{vEbuDa}1{vEbuDa}¦ •Adj. (•f. #{I}) {%den Göttern eigen , göttlich%} ¯C2IC2.18,63.¯ALAM5KA7RAT.1, ‹b.› PW107796
Line 249444: <H1>100{samuttejana}1{samuttejana}¦ •n. {%das Anfeuern , Aufreizen%} ¯MUDRA7R.55,19…(88,12).¯ALAM5KA7RAC2.17,b,1. PW120625
Line 276527: <H1>100{harahAra}1{harahAra}¦ •m. {%†C2iva's Perlenschmuck%} , ‹Bez. des Schlangendämons †C2esha› ¯VA7GAHAT2A ‹in› ¯ALAM5KA7RAT.2, {%a.%} PW133617
Line 276541: <H1>100{harAvAsa}1{harAvAsa}¦ •m. ²1) ‹Civa's› {%Wohnstätte.%} ²2) ‹Bein.› ‹des †Kaila7sa› ¯VA7GBHAT2A ‹in› ¯ALAM5KA7RAT.2, ‹a› PW133624
Line 278114: <H1>100{hAwakIya}1{hAwakIya}¦ •Adj. ‹dass.› ¯ALAM5KA7RAC2.18,a,1 ‹nach› ¯AUFRECHT. PW134362
12 DEVATA7DHJ.BRA7HM
¯D4VATA7DHJ.bra7hm. -> ¯DEVATA7DHJ.BRA7HM. in pw.txt
<H1>100{utsnAna}1{utsnAna}¦ •n. {%das Auftauchen aus dem Wasser%} ¯D4VATA7DHJ.bra7hm.3. PW18547
13 A7TMOPAN
¯ATMOPAN -> ¯A7TMOPAN in pw.txt
<H1>100{avarRya}1{avarRya}¦ •Adj. ²1) {%nicht zu beschreiben%} ¯ATMOPAN.3 ²2) {%nicht auszusagen , ~ zu prädiciren%} •Comm. ‹zu› ¯NJA7JAS.5,1,4. PW10616
@funderburkjim This ends correction submission in this thread. Hopefully all of this file dies or matches after this correction.
.SADDH. P. 4 == das 4te Kapitel des SADDHARMAPUD2N2ARIKA, -> .SADDH.P. == SADDHARMAPUD2N2ARIKA,
- I'm sure you meant ...I7KA = sadDarmapuRqarIka for the correction.
- Regarding removal of the '.4' in pwbib0, I disagree.
- There is already a 'SADDH.P' in pwbib0:
```
.SADDH. P. == SADDHARMAPUN2D2ARI7KA (KERN) . (vol. 4)
.SADDH. P. 4 == das 4te Kapitel des SADDHARMAPUD2N2ARI7KA, lithographirt in Parabole de l'Enfant egare. Par. PH. ED. FOUCAUX. Paris. 1854. (vol. 1
```
SADDH.P.4,
although one instance, under Sram has a typo in pw.txt of a period instead of that comma
after the '4'. ¯SADDH.P.4.19.@¯SADDH.P.4,19.@t@ period to comma after '4'
Re 8 ka7tj
small letter version:
.KA7TJ (A7JANA'S)KARM (APRADI7PA) (STENZLER). (vol. 5) .KA7TJ (A7JANA'S) PARIBH (A7SHA7S) IN 'Zwölf Hymnen des Rigveda, herausgegeben von ERNST WIDISCH'S. 169. fgg. (vol. 5)
The new abbreviations are KA7TJ.KARM
and KA7TJ.PARIBH
.Re 13 A7TMOPAN
I don't find this in pw.txt, but , but A7TMOPAN is found under 'avarRya' So this correction must have been previously corrected.
The bibminuscref list of 13 above and the current list ( #36 ) are identical, except for the last item.
.KAP. S. == KAPISHT2HALAKAT2HASAM5HITA, Hdschr. (V. SCHRÖDER). (vol. 2)
It was necessary to adjust abbrv.py so that this 'S' would NOT be dropped.
Here are the pw.txt corrections from this issue. not yet installed
; hw = Sram
¯SADDH.P.4.19.@¯SADDH.P.4,19.@t@ period to comma after '4 '
; hw = aBinavayOvana
¯HIT.Ed.JOHNS.1336.@¯HIT.ed.JOHNS.1336.@t@
; hw = samutpuMsana
¯ALAM5KA7DRAV.138, ‹a.›@¯ALAM5KA7RAV.138,a.@t@
; hw = utsnAna
¯D4VATA7DHJ.bra7hm.3.@¯DEVATA7DHJ.BRA7HM.3.@t@
; hw = uttitIrzu
¯ANUKR. ‹zu› ¯R2V.3,33@¯ANUKR. ‹zu› ¯R2V.3,33@t@ markup
@funderburkjim I just want to confirm how the code works in case the correction is found in more than one entries ? I would like it to replace all the occurrences.
Corrections now installed.
bibminuscref now has 0 records!
There are now 66722 records of abbrvlist.txt that match out of 72758.0 (91.7%).
Part of crefmatch.py identifies records of pwbib0 as unused. The list (as shown in crefmatch_log.txt) is
31 known unused records removed from pwbib for purposes of matching
Case 01: gan2a : im Gaṇapātha zu P.
Case 02: HARISV. : HARISVĀMIN.
Case 03: MAHA7B. : MAHĀBHĀṢYA, lith. in Benares.
Case 04: MAYR, Ind. Erb. : Das indische Erbrecht von AUREL MAYR, Wien 1873.
Case 05: NI7LAK. mit einer Zahl : NĪL.
Case 06: BÜHLER, Rep. 1872-73 : BÜHLER, Report on Sanskrit Mss. 1872--73.
Case 07: C2OBH. : COBHANAMUNI'S ŚOBHANASTUTAYAS in Z. d. d. m. G. 32, 509. fgg. (YACOBI).
Case 08: Ind.Str. : WEBER, Indische Streifen.
Case 09: MAITR. PADDH. : PADDHATI ZU MAITR. S. , Hdschr. (V. SCHRÖDER).
Case 10: GAN2ITA,MADHJA7M(A7DHJA7JA) : (KERN).
Case 11: KA7TJ(A7JANA) : nach Citaten in verschiedenen juristischen Werken(YOLLY).
Case 12: KA7TJ. C2RA7DDHAK. : KĀTYĀYANA'S ŚRĀDDHAKALPASŪTRA (STENZLER).
Case 13: PRAG4A7PATI : , nach Citaten in verschiendenen juristischen Werken (YOLLY).
Case 14: KIELHORN, Rep. : Report on the search for Sanskrit Mss. in the Bombay Presidency, during the year 1880-81. by F. KIELHORN(ROTH) .
Case 15: SVAPNAK4(INTA7MAN2I) : (PISCHEL) .
Case 16: ALAM5KA7RAR : (ATNĀKARA) des ŚOBHĀKARA, Deccan. Coll. Hdschr. No. 227 (PISCHER).
Case 17: ALAM5KA7RAS : (ARVASVA) von RUYYAKA, Deccan Coll. Hdschr. No. 231 (PISCHER).
Case 18: C2RIMA7LA7M : (ĀHĀTMYA), Hdschr. (AUFRECHT).
Case 19: DEVI7BHA7G : (AVATAPURĀṆA), Hdschr. (AUFRECHT).
Case 20: DRAVJAC2 : (UDDHITATTVA), Hdschr. (AUFRECHT).
Case 21: KA7VJA7L : (OKALOCANA), Hdschr. (AUFRECHT).
Case 22: KHAN2D2APR : (AŚASTI) im Pandit, Bed. 5 und 6 (AUFRECHT).
Case 23: K4HA7NDOGJAP : (ARIŚIṢṬA) nach Citaten in andern Werken (AUFRECHT).
Case 24: LEUMANN, Aup. Gl. : Glossar in. , Das Aupapātika Sūtra, herausgegeben von Dr. ERNST LEUMANN.
Case 25: SAM5KSHEPAC2 : (AM̃KARAJAYA) von MĀDHAVA (AUFRECHT).
Case 26: VA7MANAP : (URĀṆA), Hdschr. (AUFRECHT).
Case 27: VA7RA7HAP : (URĀNA) , Hdschr. (YACOBI).
Case 28: Bydragen : (tot de Taal-, Land-en Volkenkunde van Nederlandsch Indie) (H. KERN).
Case 29: K4ANDRA7LOKA : (R. PISCHEL).
Case 30: KA7VJA7L : (AM̃KĀRA des RUDRAṬA) (R. PISCHEL).
Case 31: SAHR2DAJA7LOKA : (R. PISCHEL).
Probably ok to close this issue.
Will continue with #38 tomorrow.
Happy New Year!
bibminuscref now has 0 records!
Glad to see at least one file reaching its logical conclusion. Closing this issue.
Also updated the submission files and github.io files after regeneration as on 1.1.2016
Happy new year.
31 sources stated in Preface and not used in Book, @funderburkjim ? How many different sources in total used in PWK?
Re how the code works in case the correction is found in more than one entries
.
Based on your comment several issues ago, I realized that your convention in submitting corrections was that a correction is to be thought of as 'generic', in the sense that there may be more instances needing correction than the specific one shown in the correction. Thus, I have been doing a 'change preparation step', which takes your corrections as input, and searches for all matches in pw.txt, regardless of the headword, and generates change records for all such.
@gasyoun Re How many different sources in total used in PWK?
According to crefmatch_log.txt file, there are 502 pwbib records, before adjustment. From this are removed:
17 records whose abbreviations are counted as duplicates:
bib duplicate abbrv: DHU7RTAS 082 322
bib duplicate abbrv: HA7SJ 113 335
bib duplicate abbrv: KAP 137 343
bib duplicate abbrv: KA7T2H 141 346
bib duplicate abbrv: KA7D 130 389
bib duplicate abbrv: K4AURAP 147 394
bib duplicate abbrv: KUMA7RAS 160 396
bib duplicate abbrv: MBH 189 398
bib duplicate abbrv: RA7G4AN 229 427
bib duplicate abbrv: BAUDH 308 437
bib duplicate abbrv: BHAVISHJOTT.P 036 442
bib duplicate abbrv: HARSHAK4 112 454
bib duplicate abbrv: KA7TJ 458 459
bib duplicate abbrv: VASISHT2HA 411 482
bib duplicate abbrv: C2A7K 051 488
bib duplicate abbrv: SARASVATI7K 476 498
bib duplicate abbrv: VA7GBH 273 501
This results in 454 records from pwbib which are 'available for use' in matching to references in pw.txt.
Exactly how many of these actually DO have a match in some pw.txt reference I don't currently know; this might be a useful statistic to generate sometime.
regardless of the headword, and generates change records for all such.
Bingo!
@funderburkjim thanks for 454. It's interesting to see that PD tried to get over 500 and Werba tried as well - and the number is always about 500. Was not aware of bib duplicate abbrv:
.
After the completion of 4 parts and some force matching, the bibminuscref.txt file has only 13 entries as of now.
This issue is dedicated to their resolution, so that the problem may be sent to its logical conclusion.