Closed Shalu411 closed 3 years ago
Namaste http://www.sanskrit-lexicon.uni-koeln.de/scans/PWGScan/2013/web/webtc2/index.php On same issue above- more light. I gave a querry regarding "nI" (keeping dhAtu in mind). But the output starts with - 1 अग्रणी,2 अग्रु,3 अग्रेणी 4 अङ्किन्5 अतस6 अतिथिग्व7 अतिशय8 अदर्शन9 अद्मसद्य10 अद्यतन11 अधिरुक्म12 अधोवर्चस्13 अन्14 अनीक15 अनीकवन्त्16 अनुनय17 अनुनय18 अनुष्ठान19 अन्तर्20 अन्तिक- (When Maximum output limit is set to 20). Then when will the turn of "nI" as a separate entry come?
This is better on http://www.sanskrit-lexicon.uni-koeln.de/scans/PWGScan/disp2/index.php- older search page - where I get the exact word in a second. So one can choose between these two displays.
[This can be put as an explanation in FAQ-question of yours- "Which is best dictionary for me?" :)]
Even in Advanced Search page, this issue can be set right by providing the searched word (here-"nI") independently first- as I had suggested earlier. And words contaning "nI" in the Head Word be given first preference. Here- These words could be happily sent to back or kept out of that small output list. 2 अग्रु,4 अङ्किन्5 अतस6 अतिथिग्व7 अतिशय8 अदर्शन9 अद्मसद्य10 अद्यतन11 अधिरुक्म12 अधोवर्चस्13 अन् 16 अनुनय17 अनुनय18 अनुष्ठान19 अन्तर्20 अन्तिक- Thankyou
aMSahara :: aMSa hara <tab> adj einen erbschaftsantheil empfangend erbend p sch jagn
Compare this to a display of the underlying data:
(1. aṁśa hara) adj. einen Erbschaftsantheil empfangend, erbend, P. 3, 2, 9, Sch. JĀǴŃ. 2, 132. 133.
You see that the summary has two parts (separated by the tab): a Sanskrit part and a German (non-Sanskrit) part; and the Sanskrit part itself has two parts, separated by the double colon, comprising the headword as first part and the Sanskrit words in the text as a second part (The Sanskrit parts are in SLP1 transliteration). You'll also see some 'simplifications' of the non-Sanskrit part. When the Advanced search program finds matching records, it goes through this Summary file, line-by-line, looking for matches. Now, you can see that implementing such enhancements as the Word Weight would require a quite different data structure. What real search Engines do in part is to create what I think are called 'inverted indexes' of the records. An inverted index would have something like a list of all Sanskrit words, one per line and an associated list of records where this Sanskrit word appears; ditto for German (or non-Sanskrit). The counts are accomplished in part, I think, by what is called 'faceting'. For example, on the bestbuy.com web site, if you search for Samsung Laptops, you see on the Left 'Computers and tablets (131), etc; this aspect of knowing there are 131 entries for this category is an example of faceting. My main point is that such issues are conceptually non-trivial; I currently do not know how to implement a more robust search engine for the Cologne sanskrit-lexicon dictionaries. The only part of your suggestions I can see as a current feasible enhancement, for PWG, is to add the 'Sanskrit Headword' line as a third search category. Do you think that would be useful to you?
@funderburkjim Point 2 and 3 are worth doing.
@Shalu411 Do you still have any issue in this regard? Or can we close this issue?
@drdhaval2785 let us start using Projects. See bellow:
Namaste In the Advanced display parameters- "Maximum" 20, 50, 100, 200 etc. is for output options. Right? (Please suggest that with "Maximum Words/results in output"- that's a different issue)
Here my point is the exact word searched for (that particular spelling-ed word) should be given first preference of appearance. Others should follow, only later. So if I am searching for a very popular word and it has 1000s of appearances, then I will have to keep on increasing my Maximum output - until I find it. This happened recently to me, when I was trying to explain how to use this page to my friend.
So this thing could be rectified by 1. giving exact word first and 2. Arranging the entries not by alphabet order, but by maximum number of occurence of the word in the text of that particular suggested word. Thankyou.