sanskrit-lexicon / PWK

Sanskrit-Wörterbuch in kürzerer Fassung, 7 Bände Petersburg 1879-1889
3 stars 1 forks source link

crefminusbib, part 2 #26

Closed drdhaval2785 closed 8 years ago

drdhaval2785 commented 8 years ago

https://github.com/sanskrit-lexicon/PWK/issues/23 continued 11 to 15

¯A7RAJABH@krAntiBujA@krAntiBujA@31818:¯A7RJABH:t:Three occurrences of this error.
¯MARA7VIRAK@sarvAkAra@sarvAkAra°@122227:¯MAHA7VIRAK4:t:Two errors H and 4.
¯R.V@pramaMhizWIya@pramaMhizWIya@72683:¯RV:t:No period intervening.
¯GAUT.Jolly., ‹Schuld@kArita@kArita@26850:¯GAUT. ¯Jolly., ‹Schuld:t:Jolly, Schuld is counted as separate entry in pwbib0.txt and also in other entries of pw.txt
¯Vikrama7n4kak4@KonamuKa@KonamuKa@33934:¯VIKRAMA7N5KAK4:t:n4->n5 and capitals
drdhaval2785 commented 8 years ago

16 to 20

¯BHU7G.P@tarza@tarza@44976:¯BHA7G.P:t:BAgavatapurARa
¯MBH.AM ‹Ende eines Comp.›@rAtra@rAtra@93727:¯MBH. ‹Am Ende eines Comp.›:t:wrong inclusion of AM in the tag.
¯K4A ƒPage4.247-1ƒ ¯RAKA6,20.@BaYj@BaYj@78555:¯K4ARAKA  ƒPage4.247-1ƒ 6,20.:t:I am proposing a shift in the page break from in between a reference to after a reference.
¯C2at.br.@akzI@akzI@373:¯C2AT.BR.:t:capitalize
¯C2I7LA7N5RA@suGarikAgfhaka@suGarikAgfhaka@126063:¯C2I7LA7N5RA:n:Not sure. N5RA is grammatically odd. But this is a new resource. Noted there.
drdhaval2785 commented 8 years ago

21 to 25

¯SA7GA7N@raktacitraka@*raktacitraka@91669:¯RA7G4AN:t:Two errors. S and no 4.
¯HENA7DRI@SUlin@SUlin@113938:¯HEMA7DRI:t:
¯KARKA@mUza@mUza@88058:¯KARAKA:t:
¯VRN2I3S@nfpaSu@nfpaSu@60920:¯VEN2I7S:t:two errors R and 3
¯GA@kapiSa@kapiSa@24295:¯GAL:t:
drdhaval2785 commented 8 years ago

26 to 30

¯MU7LLER,@vArARasIdarpaRa@vArARasIdarpaRa@100901:¯MÜLLER,:t:
¯HANV@dUramUla@*dUramUla@51823:¯DHANV:t:
¯TAYN2D2JA-BR@pipIlikamaDya@pipIlikamaDya@67110:¯TAN2D2JA-BR:t:
¯G4AGATI7@jagat@ja/gat@41229:G4agati:t:Not a reference.
¯MAT.med.@azwarasa@azwarasa@12075:¯MAT.MED.:t:May have been corrected earlier.
drdhaval2785 commented 8 years ago

31 to 33

¯C2A7C2VATA205@saptaBUma@saptaBUma@119472:¯C2A7C2VATA 205:t:Spacing issue. Only ¯C2A7C2VATA is reference. Rest is number.
¯C2AM5K.Z@turI@turI@46367:¯C2AM5K.:t:
¯C2AM5J@samyaNnati@samyaNnati@121322:¯C2AM5K:t:
drdhaval2785 commented 8 years ago

34

¯BENFEY@jaJJa@jaJJa@41375:¯BENFEY:n:

This requires investigation. @gasyoun may have a look at the scan and try to make out what it means.

The entry is capture

pwbib0.txt has the following entry only +.BENF. Chr. == BENFEY'S Chrestomathie. (vol. 1)

in pw.txt 6 occurrences of only BENFEY 2 occurrences of only BENF 1 occurrence of BENF.Chr 1 occurrence of BENFEY.CHR 1 occurrence of BENFEY.Chr

I think they all refer to the same item. Not sure.

drdhaval2785 commented 8 years ago

35

¯PRANANNAR@pataMga@pataMga/@62458:¯PRASANNAR:t:
drdhaval2785 commented 8 years ago

36

¯Sa7h.D@prARatva@prARatva/@74093:¯SA7H.D:t:Capitalization error
drdhaval2785 commented 8 years ago

37 ¯D.P Not sure, but it seems that P. refers to pANini. capture

drdhaval2785 commented 8 years ago

38 to 40

¯C2I1LA7N5KA1,257@viSarAru@viSarAru@104529:¯C2ILA7N5KA 1,257:t:
¯MAH@nirveSya@nirveSya@59691:¯MBH:t:
¯SIPARN@cakzurmuKa@ca/kzurmuKa@38370:¯SUPARN:t:
drdhaval2785 commented 8 years ago

@funderburkjim You may install both the part 1 and part 2 issues with the following submission in standard format

¯AGN.¯P.@SAlagrAma@SAlagrAma@111783:¯AGNI.P.:t:Not two separate references.
¯VIN2I7S@svEra@svEra@133244:¯VEN2I7S:t:
¯HEM.¯PAR.@cItkfta@cItkfta@40339@231:¯HEM.PAR.:t:one reference
¯BAHT2T2@sah@sah@122811:¯BHAT2T2:t:
¯KURU'S@kurupARqava@kurupARqava@29046:KURU'S:Not a reference. Only proper name.
¯DAMAJANTIK@raNku@raNku@91976:¯DAMAJANTI7K:t:
¯VARA7H.BR2H.S5,27@avanISa@avanISa@10444:¯VARA7H.BR2H.S.5,27:t:Missing period after S
¯MAUIDH@lopya@lo/pya@96775:¯MAHI7DH:t:Unclear print
¯UTPAL ‹zu› ¯VARA7H.BR2H.@rakta@rakta@91633:¯UTPALA ‹zu› ¯VARA7H.BR2H.:t:Also check whether this UTPALA is a literary resource or not ?
¯C2A7N5KH.A7r@saMSlezaRa@saMSlezaRa@116900:¯C2A7N5KH.A7R:t:
¯A7RAJABH@krAntiBujA@krAntiBujA@31818:¯A7RJABH:t:Three occurrences of this error.
¯MARA7VIRAK@sarvAkAra@sarvAkAra°@122227:¯MAHA7VIRAK4:t:Two errors H and 4.
¯R.V@pramaMhizWIya@pramaMhizWIya@72683:¯RV:t:No period intervening.
¯GAUT.Jolly., ‹Schuld@kArita@kArita@26850:¯GAUT. ¯Jolly., ‹Schuld:t:Jolly, Schuld is counted as separate entry in pwbib0.txt and also in other entries of pw.txt
¯Vikrama7n4kak4@KonamuKa@KonamuKa@33934:¯VIKRAMA7N5KAK4:t:n4->n5 and capitals
¯BHU7G.P@tarza@tarza@44976:¯BHA7G.P:t:BAgavatapurARa
¯MBH.AM ‹Ende eines Comp.›@rAtra@rAtra@93727:¯MBH. ‹Am Ende eines Comp.›:t:wrong inclusion of AM in the tag.
¯K4A ƒPage4.247-1ƒ ¯RAKA6,20.@BaYj@BaYj@78555:¯K4ARAKA  ƒPage4.247-1ƒ 6,20.:t:I am proposing a shift in the page break from in between a reference to after a reference.
¯C2at.br.@akzI@akzI@373:¯C2AT.BR.:t:capitalize
¯C2I7LA7N5RA@suGarikAgfhaka@suGarikAgfhaka@126063:¯C2I7LA7N5RA:n:Not sure. N5RA is grammatically odd. But this is a new resource. Noted there.
¯SA7GA7N@raktacitraka@*raktacitraka@91669:¯RA7G4AN:t:Two errors. S and no 4.
¯HENA7DRI@SUlin@SUlin@113938:¯HEMA7DRI:t:
¯KARKA@mUza@mUza@88058:¯KARAKA:t:
¯VRN2I3S@nfpaSu@nfpaSu@60920:¯VEN2I7S:t:two errors R and 3
¯GA@kapiSa@kapiSa@24295:¯GAL:t:
¯MU7LLER,@vArARasIdarpaRa@vArARasIdarpaRa@100901:¯MÜLLER,:t:
¯HANV@dUramUla@*dUramUla@51823:¯DHANV:t:
¯TAYN2D2JA-BR@pipIlikamaDya@pipIlikamaDya@67110:¯TAN2D2JA-BR:t:
¯G4AGATI7@jagat@ja/gat@41229:G4agati:t:Not a reference.
¯MAT.med.@azwarasa@azwarasa@12075:¯MAT.MED.:t:May have been corrected earlier.
¯C2A7C2VATA205@saptaBUma@saptaBUma@119472:¯C2A7C2VATA 205:t:Spacing issue. Only ¯C2A7C2VATA is reference. Rest is number.
¯C2AM5K.Z@turI@turI@46367:¯C2AM5K.:t:
¯C2AM5J@samyaNnati@samyaNnati@121322:¯C2AM5K:t:
¯BENFEY@jaJJa@jaJJa@41375:¯BENFEY:n:Not sure
¯PRANANNAR@pataMga@pataMga/@62458:¯PRASANNAR:t:
¯Sa7h.D@prARatva@prARatva/@74093:¯SA7H.D:t:Capitalization error
¯D.P@prAggrAmam@*prAggrAmam@73897:¯D.P:n:Not sure.
¯C2I1LA7N5KA1,257@viSarAru@viSarAru@104529:¯C2ILA7N5KA 1,257:t:
¯MAH@nirveSya@nirveSya@59691:¯MBH:t:
¯SIPARN@cakzurmuKa@ca/kzurmuKa@38370:¯SUPARN:t:
gasyoun commented 8 years ago

@drdhaval2785 D.P - if P[anini], who is D? Does not look like Dhatupatha.

Two different BENFEY works:

1 occurrence of BENF.Chr 1 occurrence of BENFEY.CHR 1 occurrence of BENFEY.Chr

Chrestomathie aus Sanskritwerken : zum Gebrauch für Vorlesungen und zum Selbststudium ; 1 Autor / Hrsg.: Benfey, Theodor Verlagsort: Leipzig | Erscheinungsjahr: 1853 | Verlag: Brockhaus

https://books.google.com.au/books?id=iGMVAAAAQAAJ&printsec=frontcover#v=onepage&q&f=false http://reader.digitale-sammlungen.de/de/fs1/object/display/bsb10522341_00005.html

Gött. Nach. = Gött[ingen] Nach[richten]

Nachrichten von der Königl. Gesellschaft der Wissenschaften und der Georg-Augusts-Universität zu Göttingen (1877) Volume: 1877, page 66-72

https://eudml.org/doc/179787 http://gdz.sub.uni-goettingen.de/dms/load/img/?PPN=GDZPPN002518171&IDDOC=54126

funderburkjim commented 8 years ago

Re ¯C2I1LA7N5KA1,257@viSarAru@viSarAru@104529:¯C2ILA7N5KA 1,257:t:

In pw.txt, how to specify the 'scope' of the literary source reference is problematic. The current scheme says that the literary source reference starts with the macron character ¯ and continues to the character before the next space character. Sometimes, Thomas used an ellipsis character as a 'soft space' character.

Because of this 'scope' issue, there should not be a space between the A and 1 Also, the spelling of this particular abbreviation starts as C2I7L, The final correction would be ¯C2I7LA7N5KA.1,257

funderburkjim commented 8 years ago

Re ¯K4A ƒPage4.247-1ƒ ¯RAKA6,20.@BaYj@BaYj@78555:¯K4ARAKA ƒPage4.247-1ƒ 6,20.:t: I am proposing a shift in the page break from in between a reference to after a reference. This was previously changed to ƒPage4.247-1ƒ ¯K4ARAKA.6,20.

funderburkjim commented 8 years ago

Re ¯MAH@nirveSya@nirveSya@59691:¯MBH:t:
With a period ¯MAH., there are 3 cases. Without a period, 438 cases. But these match things like ¯MAHI7DH, ¯MAHA7BH.Einl., etc which is not intended. So the period is needed, otherwise many new errors introduced.

Similar issue with ¯GA@kapiSa@kapiSa@24295:¯GAL:t:. As written, 2938 changes. With the (needed) period, 1 change.

funderburkjim commented 8 years ago

Re ¯C2I7LA7N5RA@suGarikAgfhaka@suGarikAgfhaka@126063:¯C2I7LA7N5RA:n:Not sure. N5RA is grammatically odd. But this is a new resource. Noted there.

Per discussion in #24, the correction is ¯C2I7LA7N5RA@¯C2I7LA7N5KA@t@

funderburkjim commented 8 years ago

Re ¯MAT.med.@¯MAT.MED.

We previously changed of the others to 'Mat.med', which is consistent with pwbib0. Will make that same change in the two ¯MAT.med. cases.

funderburkjim commented 8 years ago

Have now installed corrections for #23-#26. Here's the list. I think this is everything. First, these are done one-at-a-time

¯AGN.¯P.@SAlagrAma@SAlagrAma@111783:¯AGNI.P.:t:Not two separate references.
¯VIN2I7S@svEra@svEra@133244:¯VEN2I7S:t:
¯BAHT2T2@sah@sah@122811:¯BHAT2T2:t:
¯KURU'S ‹(d.@kurupARqava@kurupARqava@29046:‹KURU'S (d.:t:Not a reference. Only proper name.
¯VARA7H.BR2H.S5,27@avanISa@avanISa@10444:¯VARA7H.BR2H.S.5,27:t:Missing period after S
¯MAUIDH@lopya@lo/pya@96775:¯MAHI7DH:t:Unclear print
¯UTPAL ‹zu› ¯VARA7H.BR2H.@rakta@rakta@91633:¯UTPALA ‹zu› ¯VARA7H.BR2H.:t: UTPALA name of astronomer?
¯C2A7N5KH.A7r@saMSlezaRa@saMSlezaRa@116900:¯C2A7N5KH.A7R:t:case
¯SIDDH.K. ‹ed.› ¯TA7R.1,371.@tasTI@@45086:¯SIDDH.K.ed.TA7R.1,371.:t:

Second are done from #23 by finding all cases. Here's summary of number of cases found:

120 changes for  ¯HEM.¯PAR. => ¯HEM.PAR.
21 changes for  ¯HEM. ¯PAR. => ¯HEM.PAR.
6 changes for  ¯DAMAJANTIK. => ¯DAMAJANTI7K.
10 changes for  ¯C2ILA7N5KA => ¯C2I7LA7N5KA
1 changes for  ¯C2I7LA7N5RA => ¯C2I7LA7N5KA

Finally, here is summary of the ones from Dhaval's "standard form" above.

4 changes for  ¯A7RAJABH => ¯A7RJABH
1 changes for  ¯MARA7VIRAK => ¯MAHA7VIRAK4
1 changes for  ¯R.V => ¯RV
1 changes for  ¯GAUT.Jolly., ‹Schuld => ¯GAUT. ¯Jolly., ‹Schuld
1 changes for  ¯Vikrama7n4kak4 => ¯VIKRAMA7N5KAK4
1 changes for  ¯BHU7G.P => ¯BHA7G.P
1 changes for  ¯MBH.AM ‹Ende eines Comp.› => ¯MBH. ‹Am Ende eines Comp.›
13 changes for  ¯C2at.br. => ¯C2AT.BR.
1 changes for  ¯SA7GA7N => ¯RA7G4AN
1 changes for  ¯HENA7DRI => ¯HEMA7DRI
2 changes for  ¯KARKA => ¯KARAKA
1 changes for  ¯VRN2I3S => ¯VEN2I7S
1 changes for  ¯GA. => ¯GAL.
3 changes for  ¯MU7LLER, => ¯MÜLLER,
1 changes for  ¯HANV => ¯DHANV
2 changes for  ¯TAYN2D2JA-BR => ¯TAN2D2JA-BR
2 changes for  ¯G4AGATI7 => G4agati
2 changes for  ¯MAT.med. => ¯Mat.med.
1 changes for  ¯C2A7C2VATA205 => ¯C2A7C2VATA.205
1 changes for  ¯C2AM5K.Z => ¯C2AM5K.
1 changes for  ¯C2AM5J => ¯C2AM5K
1 changes for  ¯PRANANNAR => ¯PRASANNAR
1 changes for  ¯Sa7h.D => ¯SA7H.D
1 changes for  ¯C2I1LA7N5KA1,257 => ¯C2ILA7N5KA.1,257
3 changes for  ¯MAH. => ¯MBH.
1 changes for  ¯SIPARN => ¯SUPARN
funderburkjim commented 8 years ago

PWK programs rerun.

No change to pwbib cases. Still 15 remain in bibminuscref.

Some progress in abbrvlist matching.

Previously (#22) 63902 out of 73256 (87.2%)

Now, 64092 out of 73111 cases (87.6%)

funderburkjim commented 8 years ago

I think #23 and #26 can be closed now, but will let Dhaval decide.

gasyoun commented 8 years ago

¯MAHA7BH.Einl.

Einl. = Einleitung =Intro