Closed drdhaval2785 closed 8 years ago
92 SARASVATIK->SARASVATI7K See the capital I
93 C2UKRN->C2UKRAN SukranIti
94 KULA7RN2AVA,->KULA7RN2AVA No comma
95 <ls>SCHIEFNER.,</ls><ls>TA7RAN.</ls>
-><ls>SCHIEFNER.,TA7RAN.</ls>
Total 4 occurrences.
Line 85016: <H1><h><key1>mahAvihAravAsin</key1><key2>mahAvihAravAsin</key2></h><body><gram n="m">m.</gram> <gram n="Pl">Pl.</gram> <i>eine best. buddhistische Secte</i> <ls>SCHIEFNER.,</ls><ls>TA7RAN.</ls> PW85013</body><tail><L>85012</L><pc>5052-3</pc></tail></H1>
Line 85183: <H1><h><key1>mahAsAMGika</key1><key2>mahAsAMGika</key2></h><body><gram n="m">m.</gram> <gram n="Pl">Pl.</gram> <i>eine best.</i> <noti>buddhistische Schule</noti> <ls>SCHIEFNER.,</ls><ls>TA7RAN.</ls> PW85180</body><tail><L>85179</L><pc>5054-3</pc></tail></H1>
Line 85207: <H1><h><key1>mahAsudarSana</key1><key2>*mahAsudarSana</key2></h><body><gram n="m">m.</gram> <noti>N.pr. eines K4akravartin</noti> <ls>SCHIEFNER.,</ls><ls>TA7RAN.</ls> PW85204</body><tail><L>85203</L><pc>5055-1</pc></tail></H1>
Line 90434: <H1><h><key1>yavadvIpa</key1><key2>yavadvIpa</key2></h><body><gram n="m">m.</gram> <i>die Insel</i> <noti>Java</noti> <ls>SCHIEFNER.,</ls><ls>TA7RAN.263.</ls> PW90431</body><tail><L>90430</L><pc>5132-1</pc></tail></H1>
See
¯SCHIEFNER.,¯TA7RAN.@mahAvihAravAsin@mahAvihAravAsin@85012:¯SCHIEFNER,TA7RAN.:t: ¯SCHIEFNER.,¯TA7RAN.@mahAsAMGika@mahAsAMGika@85179:¯SCHIEFNER,TA7RAN.:t: ¯SCHIEFNER.,¯TA7RAN.@mahAsudarSana@*mahAsudarSana@85203:¯SCHIEFNER,TA7RAN.:t: ¯SCHIEFNER.,¯TA7RAN.263.@yavadvIpa@yavadvIpa@90430:¯SCHIEFNER,TA7RAN.263.:t:
96 KA7C2IKH->KA7C2I7KH Capital I
97 WILSON,Sel.Spec No change
Missed in cref because of trailing roman numbers (WILSON,Sel.Spec.LXXVIII) @funderburkjim, can you please update the regex in script so that trailing Roman numbers are also removed from crefs.
98 VA7MANAP-VA7MANA There are total 117 entries with VA7MANA No entry with P at the end.
EJF at several places.
Thus, the VA7MANAP one should be added to the pwbib unused list.
¯Va7man.2,2,11.@ekArTa@ekArTa@22123:¯VA7MANA.2,2,11. :t: ¯VA7MANA.AM ‹Ende eines›@kuRqala@kuRqala@28461:¯VA7MANA. ‹Am Ende eines›:t: ¯Va7mana.1,3,7.@CandaHSAstra@CandaHSAstra@40924:¯VA7MANA.1,3,7.:t: ¯Va7mana.1,3,7.@Candoviciti@Candoviciti@40965:¯VA7MANA.1,3,7.:t: (¯VA7MANA.)@daSarUpaka@daSarUpaka@49367:(¯VA7MANA. ):t: ¯VA7MANA.S.47, ‹Z. 8.›@devIBAva@devIBAva@52688:¯VA7MANA.S.47,Z.8.:t: ¯VA7MANA.FEHLT ‹sie auch im›@mil@mil@87001:¯VA7MANA ‹fehlt sie auch im›:t:
99 SUCHA7SHITA7V->SUBHA7SHITA7V suBAzitAv
100 WEBER,->WEBER
No comma
I guess this has been accounted for in the recent update in the code by Jim where ,
was added to regex for cleanup.
EJF. Changed pwbib0 to .WEBER, (BHAG) (AVATI). => .WEBER , (BHAG) (AVATI). [add space]
Have not done any 'global' changes like this, though several similar individual changes.
101 ROXB->ROXBURGH.Flora Ind.
EJF. There is one 'ROXB' in abbrvlist, where a capitalization change in pw is needed:
¯Roxb.2,97)@ajamoda@ajamoda@1262:¯ROXB.2,97 ):t
The one mentioned here (ROXBURGH.Floraind.3,380
) is clearly an inconsistency in PW.
Rather than changing pw to the consistent ROXB.3,380
, I think it is better to adjust abbrv.py.
102 Bibl.ind No change Missed in crefs because of the small letter 'ind' at the end. Total 11 occurrences
103 RA7MAPU7RVAT.Up->RA7MAPU7RVAT.UP Already done
104 NR2SUp->NR2S.UP. Corrected in 'UP' corrections
105 HAUG,Acc->HAUG.Acc
EJF. Since scan of pwbib and scan of pw both have a comma, a correction is warranted in the two pw cases.
¯HAUG.Acc.58.@mantrajAgara@mantrajAgara@83080:¯HAUG,Acc.58.:t
¯HAUG.Acc.59.@ATAyinI@ATAyinI@14326:¯HAUG,Acc.59.:t
106 APABA7KA Not found in pw.xml
EJF An error in pwbib0: .APABA7KA (JOLLY). => .APARA7RKA (JOLLY).
Occurs 3 times in abbrvlist.
Some 22 more entries. In this issue I will complete the correction submissions for 'ls' tags. Let us see how much do we match pwbib0.txt and sortedcrefs.txt Praying for 90%.
We are one. KA7TJ.C2RA7DDHAK not found - means an unused, but declared abbreviation?
EJF. Exactly. I call the list of such examples the 'pwbib unused' list.
and KA7TJ.C2RA7DDHAK was added to that list.
107 C2A7KTA7N(ANDATARAM5GIN2I)->C2A7KTA7N pwbib0 expansion.
EJF Added a space in pwbib0, so that crefmatch will treat C2A7KTA7N as the abbreviation to match.
.C2A7KTA7N(ANDATARAM5GIN2I) => .C2A7KTA7N (ANDATARAM5GIN2I)
108 <ls>GAN2IT.</ls><ls>GRAH
-><ls>GAN2IT. GRAH
It is a single reference, not two.
See
Line 50193: <H1><h><key1>dinagaRa</key1><key2>dinagaRa</key2></h><body><gram n="m">m.</gram> = <s>AhargaRa</s> <noti>2)</noti> <ls>GAN2IT.</ls><ls>GRAH.3.</ls> PW50191</body><tail><L>50189</L><pc>3087-3</pc></tail></H1>
Line 88456: <H1><h><key1>mfdu</key1><key2>mfdu/</key2></h><body><divm type="e" n="1">1)</divm> <gram n="Adj">Adj.</gram> (<gram n="f">f.</gram> <s>mfqu/</s>) <noti>und</noti> <s>mfdvI/</s> <divm type="n" n="a">a)</divm> <i>weich , zart , geschmeidig</i> <noti>(Compar.</noti> <s>mfqutara</s>) <ls>TS.PRA7T.</ls><ls>Chr.296,14</ls>); <i>mild , sanft.</i> <noti>So heissen die Mondhäuser Anura7dha</noti> , <noti>Kitra7 , Revati und Mr2gac2iras.</noti> <divm type="n" n="b">b)</divm> <i>mild</i> , <noti>so v.a.</noti> <i>schwach , mässig.</i> <divm type="n" n="c">c)</divm> <i>schwach , keinen Widerstand zu leisten vermögend.</i> <divm type="n" n="d">d)</divm> <i>langsam</i> ; <noti>in der Astron. so v.a.</noti> <i>in der oberen Apsis stehend</i> <ls>GAN2IT.</ls><ls>GRAH.14.</ls> <divm type="e" n="2">2)</divm> <gram n="m">m.</gram> <gram n="n">n.</gram> <i>Milde.</i> <divm type="e" n="3">3)</divm> <gram n="m">m.</gram> <divm type="n" n="a">a)</divm> <i>der Planet Saturn.</i> <divm type="n" n="b">b)</divm> <not...
Line 104592: <H1><h><key1>viSiKa</key1><key2>viSiKa/</key2></h><body><noti>und</noti> <s>vi/SiKa</s> <divm type="e" n="1">1)</divm> <gram n="Adj">Adj.</gram> <divm type="n" n="a">a)</divm> <i>ohne Haarschopf</i> <ls>HEMA7DRI.2,</ls> <i>a</i> , <ls>38,19.</ls> <divm type="n" n="b">b)</divm> <i>kahl</i> ; <noti>von Pfeilen so v.a.</noti> <i>unbefiedert.</i> <divm type="n" n="c">c)</divm> <i>ohne Spitze , stumpf</i> <noti>(Pfeil).</noti> <divm type="n" n="d">d)</divm> <i>ohne Flamme</i> <noti>(Feuer).</noti> <divm type="n" n="e">e)</divm> <i>ohne Spitze</i> <noti>von einem Kometen , so v.a.</noti> <i>ohne Schweif.</i> <divm type="e" n="2">2)</divm> <gram n="m">m.</gram> <divm type="n" n="a">a)</divm> <i>ein stumpfer Pfeil</i> , <noti>überh.</noti> <divm type="n" n="b">b)</divm> <i>*Spiess , Wurfspiess.</i> <divm type="n" n="c">c)</divm> = <s>Sara</s> <i>Sinus versus</i> <ls>GAN2IT.</ls><ls>GRAHAJ.6.</ls> <divm type="e" n="3">3)</divm> <gram n="f">f.</gram> <s>viSiKA</s> <divm type="n" n="a">a)</divm> <i>*...
EJF Good catch. Don't know how you noticed this! Corrections to pw:
¯GAN2IT.¯GRAH.3.@dinagaRa@dinagaRa@50189:¯GAN2IT.GRAH.3.:t:
¯GAN2IT.¯GRAH.14.@mfdu@mfdu/@88452:¯GAN2IT.GRAH.14.:t:
¯GAN2IT.¯GRAHAJ.6.@viSiKa@viSiKa/@104588:¯GAN2IT.GRAHAJ.6.:t:
For the last one, I could have made a change to print and changed 'GRAHAJ' to 'GRAH', but chose instead to force a match via a change to clean_special in abbrv.py.
109 KIELHORN,Rep Not found in pw.xml Only two occurrences of KIELHORN
Line 97671: <H1><h><key1>vaqQ</key1><key2>*vaqQ</key2></h><body>, <s>vaqQati</s> <ls>PAT.</ls> <noti>zu</noti> <ls>P.1,3,1,</ls> <ls>VA7RTT.12</ls> <noti>in</noti> <ls>KIELHORN'S.AUSG.</ls> PW97668</body><tail><L>97667</L><pc>6009-2</pc></tail></H1>
Line 110394: <H1><h><key1>SambawI</key1><key2>SambawI</key2></h><body><gram n="f">f.</gram> <noti>(!)</noti> <s>zoqaSa palASca mAzaSambawya</s> ; <ls>PAT.</ls> <noti>zu</noti> <ls>P.1,2,64,</ls> <ls>VA7RTT.59.</ls> <noti>Vgl.</noti> <ls>KIELHORN,</ls> <ls>MAHA7BH.Bd.2,</ls> <noti>S. 10.</noti> PW110388</body><tail><L>110390</L><pc>6208-1</pc></tail></H1>
110 PRA7JAC2K4ITTAV Not found in pw.xml PRA7JAC2K4ITTAT is also a separate entry in pwbib0.txt.
I am not sure about C2U3LAPA7N2I7.Pra7jac2k4ittaviveka. It seems that the first item is author and the second a text. Not sure how to change the tags. pwbib0.txt doesn't show this C2U3LAPA7N2I7 stuff.
Line 1892: <H1><h><key1>atibAla</key1><key2>atibAla</key2></h><body><divm type="e" n="1">1)</divm> <gram n="Adj">Adj.</gram> (<gram n="f">f.</gram> <s>A</s>) <i>überaus jung.</i> <divm type="e" n="2">2)</divm> <i>eine zweijährige Kuh</i> <ls>PRA7JAC2K4ITTAT.</ls> PW1888</body><tail><L>1888</L><pc>1022-3</pc></tail></H1>
Line 117966: <H1><h><key1>saMjYapana</key1><key2>saMjYa/pana</key2></h><body><gram n="n">n.</gram> <divm type="e" n="1">1)</divm> <i>das Einmüthigmachen.</i> <divm type="e" n="2">2)</divm> <i>das Tödten des Opferthieres</i> <noti>(durch Ersticken).</noti> <divm type="e" n="3">3)</divm> <i>das Betrügen , Anführen</i> <ls>C2U3LAPA7N2I7.Pra7jac2k4ittaviveka.168,</ls> <noti>a nach</noti> <ls>AUFRECHT.</ls> PW117960</body><tail><L>117962</L><pc>7026-1</pc></tail></H1>
EJF. I propose two changes:
¯C2U3LAPA7N2I7.Pra7jac2k4ittaviveka.168, ‹a nach› ¯AUFRECHT.@saMjYapana@saMjYa/pana@117962:
¯C2U7LAPA7N2I7, ¯PRA7JAC2K4ITTAVIVEKA.168, ‹a nach AUFRECHT>.:t:
111 GOVINA7N->GOVINDA7N govindAnanda Already corrected it seems.
112 NA7DAR.Up->NA7DAR.UP Already corrected
113 LILA7V->LI7LA7V lIlAvatI Capital I
EJF Correction to pwbib:
.LILA7V. == BHA7SKARA'S LILA7VATI => .LI7LA7V. == BHA7SKARA'S LI7LA7VATI7
114 DRAVJA->DRAVJAV Only one entry. That means that work identified as DRAVJA (UDDHITATTVA) may also not be proper. See
EJF. There is typo in pwbib for this one:
.DRAVJA (UDDHITATTVA) => .DRAVJAC2 (UDDHITATTVA)
That still doesn't match. DRAVJAV of text. Best to leave it unmatched for now.
Let's start another list of known PW references that are unmatched in pwbib. And add DRAVJAV as honorary first member of list.
We can also put C2U7LAPA7N2I7 in that list.
115 TAITT.Up->TAITT.UP Already corrected
116 C2OBH Not found in pw.xml
117 MAYR,Ind.Erb Not found in pw.xml Neither MAYR nor Erb found too
118 ALAM5A7RA->ALAM5KA7RA
Already corrected.
But there is one more correction needed in pwbib0.txt
ALAM5KA7RA(TNA7KARA)
->ALAM5KA7RA(RATNA7KARA)
alaMkAraratnAkara
*EJF Changed pwbib0 (for consistency with print):
.ALAM5KA7RA(TNA7KARA) => .ALAM5KA7RAR(ATNA7KARA)
119 GAN2ITA,K4ANDRAGR(AHA7DHIKA7RA)->GAN2ITA,K4ANDRAGR bracket added by pwbib0 There are two references. One with candragr, and one with candragrah Both should be merged
EJF.
. GAN2ITA,K4ANDRAGR(AHA7DHIKA7RA) => .GAN2ITA,K4ANDRAGR (AHA7DHIKA7RA)
¯G4AN2ITA.K4ANDRAGR.3.@mah@ma/h@84046:¯GAN2ITA,K4ANDRAGR.3.:t:
¯GAN2IT.26,7.¯K4ANDRAGRAH.24,35)@suKArTa@suKArTa@125905:¯GAN2ITA,K4ANDRAGRAH.26,7.24,35 ):p: For consistency with Bilbiography
119 NI7LAK.miteinerZahl->NI7LAK.
NI7LAK. mit einer Zahl == NI7L. (vol. 1)
It means that NI7LAK.[0-9]
refers to NI7L.
which itself is NI7L. == A rational Refutation of the Hindu Phisosphical Systems, by NEHEMIAH NILAKAN2T2HA SA'STRI GORE. Translated etc. by RITZ-EDWARD HALL. Calcutta 1862. (vol. 1)
Only one occurrence.
<H1><h><key1>asaMnikfzwa</key1><key2>asaMnikfzwa</key2></h><body><gram n="Adj">Adj.</gram> <i>nicht in unmittelbarer Nähe befindlich</i> <ls>NI7LAK.172.</ls> PW12376</body><tail><L>12376</L><pc>1146-3</pc></tail></H1>
EJF. I think we should add NI7LAK.miteinerZahl to the pwbib unused list. The reference in asaMnikfzwa is more easily understood as matching the pwbib entry:
.NI7LAK. == NI7LAKAN2T2HA, Commentator des MBH.
(See further separate comment below)
120 MAHA7BH.(K.)
Must have been corrected in the new pw.xml
Earlier it had misclosed brackets
<ls>MAHA7BH.(K.</ls>)
@funderburkjim may check it up.
121 WIND. SANG.->WIND. SANC. in pwbib0.txt See
122 SAHR2DAJA7LOKA Not found in pw.xml
123 KA7VJA7L
Not found in pw.xml.
It is always KA7VJA7D.
I tried with regex KA7VJA7[^D]
and it yielded 0 match.
EJF. Odd situation. There are two KA&VJA7L items in pwbib0:
Since neither has a match in pw, I'm adding both to the pwbib unused list.
On the other hand, there are 28 matches in pw to KA7VJA7D, but since there appears to be nothing in pwbib0 to match to, I'm adding KA7VJA7D to the list of known cref abbreviations with no pwbib correspondent.
124 DHJA7NAB.Up->DHJA7NAB.UP Already corrected
125 TEG4OB.Up->TEG4OB.UP Already corrected
126 GAN2ITA,SU7RJAGR(AHAN2A7DHIKA7RA)->GAN2ITA,SU7RJAGR Bracket added by pwbib0.txt
<H1><h><key1>bARa</key1><key2>bARa/</key2><hom>1</hom></h><body><noti>und</noti> <s>bA/Ra</s> <divm type="e" n="1">1)</divm> <gram n="m">m.</gram> <divm type="n" n="a">a)</divm> <i>Rohrpfeil , Pfeil.</i> <divm type="n" n="b">b)</divm> <noti>Bez.</noti> <i>der Zahl fünf.</i> <divm type="n" n="c">c)</divm> <i>®Sinus_versus</i> <ls>GAN2ITA,SU7RJAGR.14.</ls> <divm type="n" n="d">d)</divm> <i>Ziel.</i> <divm type="n" n="e">e)</divm> <i>*ein best. Theil des Pfeils.</i> <divm type="n" n="f">f)</divm> <i>®Saccharum_Sara <noti>oder</noti> eine verwandte Rohrart</i> <ls>RA7G4AN.8,82.</ls><ls>BHA7VAPR.1,209.</ls> <divm type="n" n="g">g)</divm> <i>*Kuheuter.</i> <divm type="n" n="h">h)</divm> <noti>*</noti> = <s>kevala</s>. <divm type="n" n="i">i)</divm> <noti>N.pr.</noti> <divm type="g" n="a">a)</divm> <noti>eines Asura , Feindes des Vishn2u und Günstlings des C2iva.</noti> <divm type="g" n="b">b)</divm> <noti>eines Wesens im Gefolge Skanda's.</noti> <divm type="g" n="g">g)</divm> <noti>zweier Fürsten , eines Autors</noti> (<ls>KA7D.4,14</ls>) <noti>und anderer Männer.</noti> <divm type="e" n="2">2)</divm> <gram n="m">m.</gram> (*<gram n="f">f.</gram> <s>A</s>) <i>eine blau blühende Barleria</i> <ls>RA7G4AN.10,137.140.</ls><ls>BHA7VAPR.3,98.</ls> <divm type="e" n="3">3)</divm> *<gram n="f">f.</gram> <s>A</s> <i>das hintere Ende eines Pfeils.</i> <divm type="e" n="4">4)</divm> <gram n="n">n.</gram> <divm type="n" n="a">a)</divm> <i>die Blüthe von</i> <noti>2).</noti> <divm type="n" n="b">b)</divm> <i>Körper.</i> PW76611</body><tail><L>76609</L><pc>4219-3</pc></tail></H1>
@funderburkjim This ends correction submission for bibminuscref.xml. Let us know the statistics after all the corrections are installed.
I know this has been very confusing submission for me. Many a times pw.xml, pwbib0.txt and pwbib1.txt needed correction and all has been submitted intermixed here. If anything is not clear, I beg your apologies. But from whatever I read in earlier installation, both of us seem to be on the same frequency. You guess correctly where I want the correction to be made. Best luck for the three parts yet to be done.
NI7LAK.miteinerZahl = NI7LAK mit einer Zahl = NI7LAK with a number.
Re NI7LAK.miteinerZahl
After reading @gasyoun's comment , and rereading @drdhaval2785's comment, I am doubting what I said under this item,
Should we instead make a print change correction to <ls>NI7L.172.</ls>
?
I found only one other NI7L. abbreviation in pw, NI7L.35.@anAdi@anAdi@3871.
Will go ahead with installation of changes per my interspersed comments. If we decide differently on NI7LAK, can add on the change.
Corrections now installed. And PWK updated with new pw.xml, and new pwbib0, etc.
There are now 460 pwbib records, after removing duplicates and those identified as unused (see below).
Of these, 445 are found to match instances in pw 'ls' records (97%).
There are only 15 cases remaining in bibminuscref.txt. @drdhaval2785 Maybe take another look? Perhaps I have missed some installation details on these.
87% of the 73256 items of abbrvlist.txt are acounted for. Still about 10,000 unaccounted for. I think @drdhaval2785 has already started attacking these.
PWK repository now updated.
The pw sync file is updated.
Here is the current list of pwbib abbreviations that are unused:
21 MAHA7B
22 C2RIMA7LA7M
26 Bydragen
28 HARISV
29 gan2a
33 SVAPNAK4(INTA7MAN2I)
14 LEUMANNA,Aup.Gl
PWK#20
38 VA7RA7HAP
41 PRAG4A7PATI
43 MAITR.PADDH
51 KHAN2D2APR
59 ALAM5KA7RAS
PWK#21
65 BÜHLER,Rep.1872-73
66 DEVI7BHA7G
76 DEVATA7DHJ.BRA7HM
81 Ind.Str.
90 GAN2ITA,MADHJA7M(A7DHJA7JA)
90 GAN2ITA,K4ANDRAGR(AHA7DHIKA7RA)
109 KIELHORN,Rep
PWK#22
91 KA7TJ.C2RA7DDHAK
98 VA7MANAP
116 C2OBH
119 NI7LAK.miteinerZahl
122 SAHR2DAJA7LOKA
123 KA7VJA7L
... K4HA7NDOGJAP Added by ejf.
NI7LAK.miteinerZahl
Here is the start of a list to contain literary source abbreviations that we think are missing from pwbib.
DRAVJAV
C2U7LAPA7N2I7
KA7VJA7D
At some point, I should probably modify a program (either abbrv.py or crefmatch.py, not sure which) to use this list, so that we won't have to rethink these members of crefminusbib.txt.
With possible exception of the NILA7K case, this issue can be closed.
There are now 460 pwbib records, after removing duplicates and those identified as unused (see below).
If unused are in the Prefaces of PWK, still they should be mentioned, maybe with an additional tag. Would like to make a list of Reference works used for my Reverse dictionary. Unused is interesting in real life application, but should still remain in the original list, I guess. @funderburkjim agree?
Waiting for @drdhaval2785
@gasyoun Yes. Agree that the Unused (and duplicates) should remain in pwbib. Probably a good idea also to annotate a final printed version of pwbib regarding the unused and duplicates that we identify.
annotate a final printed version of pwbib regarding the unused and duplicates that we identify
Absolutely the way to go.
@funderburkjim I guess we can close this issue and you can install it. Maybe we can also add NI7LAK.miteinerZahl to the pwbib unused list. Close the issue. I agree with your observations.
And we should also start maintaining a 'doubtful.txt' file also for such issues where we are not sure and still have made some decision / deferred some decision because of doubt. Some expert somewhere in world may help. कालो ह्ययं निरवधिर्विपुला च पृथ्वी ।
Added pwbib_unused.txt.
91 KA7TJ.C2RA7DDHAK not found All are GOBH.C2RA7DDHA