sanskrit-lexicon / PWK

Sanskrit-Wörterbuch in kürzerer Fassung, 7 Bände Petersburg 1879-1889
3 stars 1 forks source link

commentary study #39

Open drdhaval2785 opened 8 years ago

drdhaval2785 commented 8 years ago

https://github.com/sanskrit-lexicon/PWK/tree/master/pw_ls/pw_dhaval/abbrvwork/commentary

This subrepository tries to analyse the commentary references in pw.txt

As of now

A. grep ' ‹zu› ' ../../../../../Cologne_localcopy/pw/orig/pw.txt > commentaryrefs.txt

This command fetches all the ' ‹zu› ' occurrences from pw.txt and stores to commentaryrefs.txt

B. Run commentarystudy.py 

This takes commentaryrefs.txt as input and gives the following output. 
B1. commleft.txt - references where the pattern matches '•Comm. ‹zu› ¯([A-Za-z0-9.,]*)'

https://github.com/sanskrit-lexicon/PWK/blob/master/pw_ls/pw_dhaval/abbrvwork/commentary/commleft.txt - this is the file which stored the references which follow •Comm. ‹zu› ¯. Please note - there are many references in this list which start with a numeral. Needs examination. Commentary on a number seems odd, if not wrong.

gasyoun commented 8 years ago

I'll not dive into this ocean for now, seems to deep. There should be a list of "service" words, not only zu, but an, fgg.. But as it is even above general abbreviation research, I must understand that alone I can't manage all of the already open issues. I would first finish the list of 12 lacking references in PW.

funderburkjim commented 8 years ago

@drdhaval2785 Skipping this issue for now.

drdhaval2785 commented 8 years ago

Yes. I am trying to refine the python code to get some more easily workable and manageable items. Please leave it as it is now.