Closed vvasuki closed 8 years ago
Are there any pdfs of this book avialable?? As source file is not giving clear picture, because of lack of clarity in some tags. if pdf is there it will some how improve final output. I searched in archives, scribd, and so many.. but cannot found it.
http://www.sanskrit-lexicon.uni-koeln.de/scans/MCIScan/2014/web/index.php इति दृश्यताम्
2016-04-18 8:56 GMT-07:00 श्रीराम notifications@github.com:
Are there any pdfs of this book avialable?? As source file is not giving clear picture, because of lack of clarity in some tags. if pdf is there it will some how improve final output. I searched in archives, scribd, and so many.. but cannot found it.
— You are receiving this because you authored the thread. Reply to this email directly or view it on GitHub https://github.com/sanskrit-coders/stardict-sanskrit/issues/5#issuecomment-211443800
Vishvas /विश्वासः
धन्यावाद: /\
Sorry for late. my laptop got repaired. and yesterday i got it back to work. now mahabharata_cultural_index completed. Update....
Dear dAmodara - wonderful progress!
I don't see any value whatsoever in including digits such as : 04) अङ्गद. In fact it is a negative - it brings up all such bad suggestions if we ever in the future make a dictionary whose indices genuinely need to start with a number (eg: sutra 04.2.23). I urge you to remove them.
Also, while this is superb work, it is laborious and hard to reproduce - do you intend to produce a script?
Also, can you replace page of the form p169 keywords with mahAbhArata cultural index p169? Users would not like p169 of MCI to be shown with p169 of some other book they're not interested in..
Oh. So i will replace p169 with 'p169.mci' like that. then it will solve second problem. Regarding first one, So search results may became problem, when other dicts really nead them. reason why i included them is when we search a page number 'p182' like that, the results are not in order of word order in original book. in original book, it fallows iast alphabetical order, and golden dict fallows order of language of first head word. So to fix it i added word number before duplicate of original word. thus it became browsable. So it is not intended for search purpose, rather for display. So to solve present side effect in search results, i will append an untypable unicode special char, before that number, so word order will be same, display will be good, no inconvinience in search results. like '★ 04) अङ्गद ' . and '★' or what ever char makes display also nice, and, word order will be there.
And regarding scripts, i am now learning higher human readable languages like u suggested python . So with in one month, i will try to learn and, produce a more general and interactive script, which can be used on any of these text files. please give this chance, it will be definately worth this wait,,, i promice. : )
Great news on the script! Eagerly await it.
"So it is not intended for search purpose, rather for display. So to solve present side effect in search results," - thanks for clarifying! Plz include that note in the README. Although I think that you are solving an unimportant problem with it - do users really care that the sort order for a given page is off? I fear that such distraction will keep you from getting to more important problems.
fixed all problems : )
closing issue, as main improvements are done, and it is well readable, and browsable.
We must produce a script to transform mci.txt into a babylon dictionary.
Regarding scripting - scripts in https://github.com/sanskrit-coders/stardict-sanskrit/tree/master/sa-head/kalpadruma-sa/mUlam may help you with a speedy start!