sanskrit-lexicon / PWK

Sanskrit-Wörterbuch in kürzerer Fassung, 7 Bände Petersburg 1879-1889
3 stars 1 forks source link

Commentary references #30

Open drdhaval2785 opened 8 years ago

drdhaval2785 commented 8 years ago

Based on a comment at https://github.com/sanskrit-lexicon/PWK/issues/29#issuecomment-165959969, I have used the following command

$ grep '‹zu› ¯[^.]*' pw.txt > commentaryrefs.txt

If we come to a conclusion that we need to consider commentaries on some book as a separate literary resource, we need to examine https://github.com/sanskrit-lexicon/PWK/blob/master/pw_ls/pw_dhaval/abbrvwork/commentary/commentaryrefs.txt

Total 3073

funderburkjim commented 8 years ago

@drdhaval2785 commentaryrefs.txt is a good resource!

drdhaval2785 commented 8 years ago

Commentaryrefs.txt has been analysed by commentarystudy.py. The output are four files. bothsides.txt - references on both sides of zu. leftside.txt - reference on left side of zu, but not on right rightside.txt - reference on right side of zu, but not on left commleft.txt - 'Comm.' on the left of zu and reference on the right.

gasyoun commented 8 years ago

@drdhaval2785 well done.

As per https://github.com/sanskrit-lexicon/PWK/blob/master/pw_ls/pw_dhaval/abbrvwork/commentary/bothsides.txt

VIG4AJAGAN2I.zu.DAMAJANTI7K.DERSELBE -> `DERSELBE` ordinary German word
K4AN2D2IDA7SA.zu.KA7VJAPR.nach -> `.nach` ordinary German word
Da7rila.zu.KAUC2.U.S.W -> `U.S.W.` German words means `etc.`
UTPALA.zu.Vara7h.br2h.11,2.fgg -> `.11,2.fgg` - no need, fgg = following pages

When we have UTPALA.zu.VARA7H.BR2H There is no need for

UTPALA.zu.VARA7H.BR2H.27…(25)…
UTPALA.zu.VARA7H.BR2H.DALAJOGA
UTPALA.zu.VARA7H.BR2H.S
UTPALA.zu.VARA7H.BR2H.s

https://github.com/sanskrit-lexicon/PWK/blob/master/pw_ls/pw_dhaval/abbrvwork/commentary/leftside.txt

Fuzzy corrections will not hurt

UTAPALA
UTPALA
UTPLA

https://github.com/sanskrit-lexicon/PWK/blob/master/pw_ls/pw_dhaval/abbrvwork/commentary/rightside.txt

Empty

https://github.com/sanskrit-lexicon/PWK/blob/master/pw_ls/pw_dhaval/abbrvwork/commentary/commleft.txt

H.an -> an German word meaning "at"

@zaaf any idea if it's good to kill the Roman numerals?

TS.I
TS.III

Fishy FOBH

There can be no UTPALA.zu. or even UTPALA.zu (similar as Va7rtt.zu., GOVINDA7N.zu.C2ak5k.zu)

<ls>Utpala.zu</ls> <ls>VARA7H.BR2H.15,1.</ls>
<ls>Utpala.zu</ls><ls>VARA7H.BR2H.7,1,12.26</ls>
<ls>Utpala.zu</ls><ls>VARA7H.BR2H.20,10,11.</ls>

1st case has a space between tags. If it - so all others should as well. Or kill all spaces. Should there really be 2 LS tags, Jim? If so instead of <ls>Utpala.zu</ls> <ls>VARA7H.BR2H.15,1.</ls> it should be <ls>Utpala</ls> zu <ls>VARA7H.BR2H</ls><other tag>15,1.</other tag> - this way it has some practical use and could be hyperlinked one far day. But not when all is glued together.

None of the numbers needed (because it's not a MSS number in a collection)

C2AM5K.zu.BA7DAR
C2AM5K.zu.BA7DAR.1,1,1…(S
C2AM5K.zu.BA7DAR.2,1,10.…
C2AM5K.zu.BA7DAR.2,2,11.(einmal
C2AM5K.zu.BA7DAR.2,2,17(S.543Z
C2AM5K.zu.BA7DAR.2,2,28…(S.569,…Z
C2AM5K.zu.BA7DAR.2,3,6(S
C2AM5K.zu.BA7DAR.3,1,8…(S
C2AM5K.zu.BA7DAR.3,2,5.fgg
C2AM5K.zu.BA7DAR.3,3,41…(S
C2AM5K.zu.BA7DAR.3,4,17.fgg
C2AM5K.zu.BA7DAR.4,3,14…(S