sanskrit-lexicon / PWK

Sanskrit-Wörterbuch in kürzerer Fassung, 7 Bände Petersburg 1879-1889
3 stars 1 forks source link

crefminusbib, Part 7 #41

Closed funderburkjim closed 8 years ago

funderburkjim commented 8 years ago

A program (numfuzzy.py) generated changes to some literary source abbreviations in PW. The criterion were:

In this case, the program assumed that the pwbib abbreviation was correct, and generated a correction of the pw abbreviation to agree with the pwbib abbreviation.

It was necessary to do some editing of the program-generated changes in about 20% of the cases.

The total number of pw changes ultimately generated numbered about 500.

funderburkjim commented 8 years ago

Here is a summary of the changes made:

5 changes for  ¯SUPAR2N2. => ¯SUPARN2.
1 changes for  ¯KAN2D2AK. => ¯K4AN2D2AK.
1 changes for  ¯A7RJA7V. => ¯A7RJAV.
1 changes for  ¯K4A7D. => ¯KA7D.
1 changes for  ¯M4UDRA7R. => ¯MUDRA7R.
1 changes for  ¯C2A7C2VATA568. => ¯C2A7C2VATA.568.
1 changes for  ¯GA7L. => ¯GAL.
10 changes for  ¯SUPAR2N. => ¯SUPARN2.
1 changes for  ¯ATMOPAN. => ¯A7TMOPAN.
7 changes for  ¯GIT. => ¯GI7T.
5 changes for  ¯NI1LAK. => ¯NI7LAK.
3 changes for  ¯C2A7N4KH.BR. => ¯C2A7N5KH.BR.
1 changes for  ¯G4AN2IT.BHAGAN2. => ¯GAN2IT.BHAGAN2.
1 changes for  ¯C2A7C2VATA774. => ¯C2A7C2VATA.774
7 changes for  ¯KATJ. => ¯KA7TJ.
4 changes for  ¯RASENDRAK. => ¯RASENDRAK4.
1 changes for  ¯C2A7C2VATA513. => ¯C2A7C2VATA.513.
2 changes for  ¯KAMPAKA37. => ¯K4AMPAKA.37.
2 changes for  ¯BHAG.P. => ¯BHA7G.P.
1 changes for  ¯K4AT2H. => ¯KA7T2H.
2 changes for  ¯C2A7C2VATA197. => ¯C2A7C2VATA.197.
1 changes for  ¯TAITT.AR. => ¯TAITT.A7R.
4 changes for  ¯VISHNUS. => ¯VISHN2US.
2 changes for  ¯AV.PARIC. => ¯AV.PARIC2.
1 changes for  ¯GAN2IT.BHAGAN. => ¯GAN2IT.BHAGAN2.
1 changes for  ¯SUPARN. => ¯SUPARN2.
2 changes for  ¯C2I7LAN5KA. => ¯C2I7LA7N5KA.
1 changes for  ¯CA7M5K. => ¯C2AM5K.
2 changes for  ¯TS4. => ¯TS.
1 changes for  ¯KATJ.C3R. => ¯KA7TJ.C2R.
1 changes for  ¯RASENDRAK4120. => ¯RASENDRAK4.120.
9 changes for  ¯GA7TAKAM. => ¯G4A7TAKAM.
1 changes for  ¯C2A7C2VATA438. => ¯C2A7C2VATA.438.
1 changes for  ¯KA4URAP. => ¯K4AURAP.
1 changes for  ¯VI7RAM. => ¯VIRAM.
1 changes for  ¯D2AC2AK. => ¯DAC2AK.
1 changes for  ¯KARAKA. => ¯K4ARAKA.
2 changes for  ¯KATJ.C2R. => ¯KA7TJ.C2R.
1 changes for  ¯BI1G4AG. => ¯BIG4AG.
3 changes for  ¯MAHA7VIRAK4. => ¯MAHA7VI7RAK4.
1 changes for  ¯KA7P.S. => ¯KAP.S.
6 changes for  ¯TA7ND2JA-BR. => ¯TA7N2D2JA-BR.
4 changes for  ¯C2A7N4KH.C2R. => ¯C2A7N5KH.C2R.
1 changes for  ¯MAN.C2R. => ¯MA7N.C2R.
1 changes for  ¯ALAM5KA4RAT. => ¯ALAM5KA7RAT.
1 changes for  ¯C2A7C2VATA25. => ¯C2A7C2VATA.25.
8 changes for  ¯KA7T2J. => ¯KA7TJ.
1 changes for  ¯MUN2D.UP. => ¯MUN2D2.UP.
4 changes for  ¯VP4. => ¯VP.
2 changes for  ¯VP2. => ¯VP.
1 changes for  ¯MA7LAV4. => ¯MA7LAV.
1 changes for  ¯KA7RA2ND2. => ¯KA7RAN2D2.
1 changes for  ¯C2I2C. => ¯C2IC2.
1 changes for  ¯RV. => ¯R2V.
2 changes for  ¯KA7RA7N2D2. => ¯KA7RAN2D2.
5 changes for  ¯R2. => ¯R.
8 changes for  ¯KA7RAKA. => ¯K4ARAKA.
1 changes for  ¯C2A7C2VATA61  => ¯C2A7C2VATA.61 
2 changes for  ¯UNA7DIS. => ¯UN2A7DIS.
1 changes for  ¯C2I1LA7N5KA1. => ¯C2I7LA7N5KA.1.
1 changes for  ¯ALAM5KA4RAC2. => ¯ALAM5KA7RAC2.
1 changes for  ¯BHAVA7PR. => ¯BHA7VAPR.
6 changes for  ¯KA7T2J.C2R. => ¯KA7TJ.C2R.
1 changes for  ¯PR.P4. => ¯PR.P.
2 changes for  ¯C2A7N4KH.GR2HJ. => ¯C2A7N5KH.GR2HJ.
1 changes for  ¯GA7UT. => ¯GAUT.
3 changes for  ¯K4AD. => ¯KA7D.
1 changes for  ¯RASENDRAK496. => ¯RASENDRAK4.96.
1 changes for  ¯AV.PRA7JA7C2K4. => ¯AV.PRA7JAC2K4.
1 changes for  ¯K5AMPAKA. => ¯K4AMPAKA.
1 changes for  ¯A7PAST4.C2R. => ¯A7PAST.C2R.
1 changes for  ¯VAGRAK4K4H. => ¯VAG4RAK4K4H.
1 changes for  ¯PRATIG4N5A7S. => ¯PRATIG4N4A7S.
2 changes for  ¯K4A7RAN2D2. => ¯KA7RAN2D2.
1 changes for  ¯K4ULIKOP. => ¯K4U7LIKOP.
2 changes for  ¯KAMPAKA. => ¯K4AMPAKA.
1 changes for  ¯NJA7JAM4. => ¯NJA7JAM.
1 changes for  ¯R6V. => ¯R2V.
4 changes for  ¯A7PAST.CR. => ¯A7PAST.C2R.
1 changes for  ¯NJA7JA7S. => ¯NJA7JAS.
1 changes for  ¯C2A7C2VATA388. => ¯C2A7C2VATA.388.
1 changes for  ¯C2A7C2VATA476. => ¯C2A7C2VATA.476.
1 changes for  ¯C2A7C2VATA479. => ¯C2A7C2VATA.479.
1 changes for  ¯A7C2V.GRHJ. => ¯A7C2V.GR2HJ.
1 changes for  ¯NI7LAK4. => ¯NI7LAK.
1 changes for  ‹(nach› ¯NI7LAK4.) =>  (‹nach› ¯NI7LAK4. )
100 changes for  ¯NILAK. => ¯NI7LAK.
1 changes for  ¯GAN2IT.SPASHT4. => ¯GAN2IT.SPASHT2.
17 changes for  ¯BIG4AG. => ¯BI7G4AG.
1 changes for  ¯GI10T. => ¯GI7T.
14 changes for  ¯NI3LAK. => ¯NI7LAK.
1 changes for  ¯C2ILA7N5KA. => ¯C2I7LA7N5KA.
1 changes for  ¯K4AVJAPR. => ¯KA7VJAPR.
4 changes for  ¯KA7P. => ¯KAP.
1 changes for  ¯P4AN4K4AR. => ¯PAN4K4AR.
1 changes for  ¯LALIT4. => ¯LALIT.
1 changes for  ¯A7PAST.C2R2. => ¯A7PAST.C2R.
8 changes for  ¯BHA7T2T2. => ¯BHAT2T2.
5 changes for  ¯C2I7LA7N4KA. => ¯C2I7LA7N5KA.
2 changes for  ¯KA4RAKA. => ¯K4ARAKA.
2 changes for  ¯A7PAST4. => ¯A7PAST.
1 changes for  ¯C2A7C2VATA280 => ¯C2A7C2VATA.280
1 changes for  ¯G4A8TAKAM. => ¯G4A7TAKAM.
1 changes for  ¯C2A7NKH.C2R. => ¯C2A7N5KH.C2R.
1 changes for  ¯LA7T26J. => ¯LA7T2J.
3 changes for  ¯DA7C2AK. => ¯DAC2AK.
1 changes for  ¯DA7C2AR. => ¯DAC2AR.
2 changes for  ¯A7PAST.C2R4. => ¯A7PAST.C2R.
3 changes for  ¯KA7UC2. => ¯KAUC2.
1 changes for  ¯C2A7C2VATA263. => ¯C2A7C2VATA.263.
1 changes for  ¯KA7T4J.C2R. => ¯KA7TJ.C2R.
1 changes for  ¯VRSHABH. => ¯VR2SHABH.
1 changes for  ¯SHADV.BR. => ¯SHAD2V.BR.
4 changes for  ¯HARSHAK44. => ¯HARSHAK4.
2 changes for  ¯C2I3LA7N5KA. => ¯C2I7LA7N5KA.
4 changes for  ¯KHA7ND.UP. => ¯K4HA7ND.UP.
3 changes for  ¯AV.PRA7JAC2K. => ¯AV.PRA7JAC2K4.
1 changes for  ¯C2A7C2VATA292. => ¯C2A7C2VATA.292.
2 changes for  ¯KATJ.CR. => ¯KA7TJ.C2R.
1 changes for  ¯C2A7NKH.BR. => ¯C2A7N5KH.BR.
1 changes for  ¯RASENDRAK456. => ¯RASENDRAK4.56.
2 changes for  ¯BHA7G.5 => ¯BHAG.5
1 changes for  ¯BHA7G.3 => ¯BHAG.3
1 changes for  ¯BHA7G.4 => ¯BHAG.4
1 changes for  ¯BHA7G. ¯ed.Bomb.3,19,14. =>  ¯BHAG. ¯ed.Bomb.3,19,14.
1 changes for  ¯BHA7G.¯P.4,11,10. => ¯BHA7G.P.4,11,10.
1 changes for  ¯MA7HIDH  => ¯MAHI7DH 
16 changes for  ¯TA7NDJA-BR. => ¯TA7N2D2JA-BR.
1 changes for  ¯GAN2IT.BHAGAN29 => ¯GAN2IT.BHAGAN2.9
1 changes for  ¯GAN2IT.SPASHT. => ¯GAN2IT.SPASHT2.
1 changes for  ¯A7PA7ST.C2R. => ¯A7PAST.C2R.
1 changes for  ¯K2A7RAN2D2. => ¯KA7RAN2D2.
1 changes for  ¯PRATIG4N4AS. => ¯PRATIG4N4A7S.
1 changes for  ¯ARUN2.UP. => ¯A7RUN2.UP.
1 changes for  ¯VI7SHN2US. => ¯VISHN2US.
4 changes for  ¯VAG4RAK4KH. => ¯VAG4RAK4K4H.
37 changes for  ¯HARSHAK. => ¯HARSHAK4.
14 changes for  ¯G4ATAKAM. => ¯G4A7TAKAM.
1 changes for  ¯KAMPAKA470. => ¯K4AMPAKA.470.
1 changes for  ¯A7PAST.CR2. => ¯A7PAST.C2R.
1 changes for  ¯C2A7C2VATA105. => ¯C2A7C2VATA.105.
1 changes for  ¯BI7G5AG. => ¯BIG4AG.
1 changes for  ¯GR2HJAS. => ¯GR2HJA7S.
1 changes for  ¯C2A7T.BR. => ¯C2AT.BR.
2 changes for  ¯G4AN2IT. => ¯GAN2IT.
1 changes for  ¯VAG5RAK4K4H. => ¯VAG4RAK4K4H.
1 changes for  ¯C2A5NKH.C2R. => ¯C2A7N5KH.C2R.
1 changes for  ¯C2A7C2VATA554. => ¯C2A7C2VATA.554.
1 changes for  ¯RA7SENDRAK4. => ¯RASENDRAK4.
1 changes for  ¯RA7TNAM. => ¯RATNAM.
8 changes for  ¯K4ARAN2D2. => ¯KA7RAN2D2.
1 changes for  ¯A7K. => ¯AK.
1 changes for  ¯KA7URAP. => ¯K4AURAP.
1 changes for  ¯TAITT.A4R. => ¯TAITT.A7R.
1 changes for  ¯C2A7C2VATA630. => ¯C2A7C2VATA.630.
1 changes for  ¯R4. => ¯R.
1 changes for  ¯C2A7C2VATA543. => ¯C2A7C2VATA.543.
1 changes for  ¯M2R2K4K4H. => ¯MR2K4K4H.
1 changes for  ¯CULBAS. => ¯C2ULBAS.
2 changes for  ¯ARSH.BR. => ¯A7RSH.BR.
8 changes for  ¯TAN2D2JA-BR. => ¯TA7N2D2JA-BR.
1 changes for  ¯CAT.BR. => ¯C2AT.BR.
1 changes for  ¯C2A7C2VATA806 => ¯C2A7C2VATA.806
1 changes for  ¯K4ARAND. => ¯KA7RAN2D2.
1 changes for  ¯NJA7JAS4. => ¯NJA7JAS.
1 changes for  ¯NJAJAS. => ¯NJA7JAS.
funderburkjim commented 8 years ago

PWK programs rerun.

No change to pwbib cases. Still 15 remain in bibminuscref.

Some progress in abbrvlist matching.

Previously (#26) 64092 out of 73111 cases (89.5%) Now, 66112 out of 73116 cases (90.4%)

drdhaval2785 commented 8 years ago

@funderburkjim

No change to pwbib cases. Still 15 remain in bibminuscref.

They have been examined and suggested corrections in https://github.com/sanskrit-lexicon/PWK/issues/37. I insist that you complete that correction first. That way we would close one issue (bib minus cref) conclusively and exhaustively. Then there would be no member in that list.

gasyoun commented 8 years ago

Lovely approach, huge results.

drdhaval2785 commented 8 years ago

I agree. It is a good leap forward.