indic-dict / stardict-sanskrit

Stardict dictionary files for the Sanskrit language.
https://sanskrit-coders.github.io/dictionaries/offline/
76 stars 16 forks source link

Proofread: Mayrhofer's Kurzgefasstes etymologisches Wörterbuch des Altindischen #137

Open suhasm opened 2 years ago

suhasm commented 2 years ago

http://samskrtam.ru/sanskrit-lexicon/KEWA/

Can a stardict file be made that shows the screenshots for each headword?

vvasuki commented 2 years ago

Note from closed issue:

http://samskrtam.ru/sanskrit-lexicon/KEWA/ by marcis @gasyoun merits being OCR-ed and stardictified (headwords are already cleaned - so it should be a simple scripting task). BVP discussion https://groups.google.com/d/msgid/bvparishat/d4a94a28-55e4-4c88-af12-fc19fe9b9304n%40googlegroups.com .

vvasuki commented 2 years ago

It contains only 9587 entries until आयुः though. Wonder if there's a way to get the rest. @gasyoun - do you have the rest?

vvasuki commented 2 years ago

सिद्धम्। OCR succeeded mostly - barring Greek and some accents, which require manual correction.

vvasuki commented 2 years ago

@Andhrabharati is working on proofreading this.

Andhrabharati commented 2 years ago

It contains only 9587 entries until आयुः though. Wonder if there's a way to get the rest. @gasyoun - do you have the rest?

सिद्धम्। OCR succeeded mostly - barring Greek and some accents, which require manual correction.

@Andhrabharati is working on proofreading this.

@vvasuki I found that your OCR has the need for more corrections for accents etc. and I did a complete redo of all the 3 volumes including the front pages and the missing ending 180 pages of Vol. 3 in @gasyoun's pdf set (and hence in your OCR). [There is a combined scan of the 4 volumes at archive.org]

The summary of entries (total : 12007) is- base entries : 9112 new entries : 378 entry revisions : 2517

I had finished about 30% of the overall work (planned in phases), took a small break now and would be resuming the project soon.