Open vvasuki opened 4 years ago
https://github.com/sanskrit/raw_etexts/blob/master/koshaH/raghuvIra/english-hindi-technical-terms-split.pdf.txt has 2-column split ocr text. pretty good. manual labor required is mostly separating out lines and marking headwords. Must check if regex magic can be employed to some extant.
Basically work would involve marking headwords and separating entries into separate lines.
@drdhaval2785 has kindly taken up the task of generating dict from this.
Bigger list of raghuvIra's dicts.
@drdhaval2785 I recall that raghuvIra uses brAhmi letters for abbreviations in some places. In such cases, best to replace with devanAgarI.
Also, good to mark up equivalent hindi terms - say with {}
to facilitate generating reverse lookup dicts. Can probably be done mechanically for the most part.
I suppose that this effort was abandoned by @drdhaval2785 . Samskrita bhAratI seems interested:
"Shankar from my team will call you after lunch.. we are digitizing Raghuvir’s dictionary of tech terms.. we want some guidance/ideas from you as to how to structure the txt file so that in future it can be used to build a stardic format file."
इति शास्त्रिमहोदयः संस्कृतभारत्या।
I hereby declare that I have not at all started anything worthwhile in this dictionary.
TSV files were received as requested at https://drive.google.com/drive/folders/1qJRs9kczw98-zRx4gGxPfejhIagWOxb0 (thanks to Jishnu Suresh and MVR shAstrI of saMskRta-bhArati's ebhAratI sampat team).
However, several errors persist.
For example, observe that the definition here has spilled over to the next lines, and the ERROR in the same screenshot:
Namaskara, Hope the file uploaded on 17thNov is as per the expectations. Please let us know for any further changes.
Regards,
Ebs team
सूचनयोपकृतोऽस्मि। (इतः पूर्वम् भवद्भिस् सञ्चिका परिष्कृतेति सूचनैव न लब्धा!)
एतावत्यो दुष्टपङ्क्तयो विद्यन्ते, ता अन्विष्य विनिवार्य सूचयतु -
['alloxanic adj. Chem. उपतिग्मिक', 'Chemistry']
['alloxantin Chem. उपतिग्मकि f.', 'Chemistry']
['Betula verrucosa (white birch) श्वेत भूर्ज']
['bulb of the urethra Zoology मूत्रमार्ग-कन्द']
[]
[]
['children pl. of child']
['Co = गु =']
['(differential coefficient) p-102']
['(partial differential coefficient)']
['(incremental ratio)']
[]
[]
[]
[]
[]
[]
[]
['phloro-glucinol Chem. शिफ-मध्विव', 'Chemistry']
['phlorol (phloretic acid+-ol) Chem. शिफव m.', 'Chemistry']
['possessions see possession']
["Phys. (Stefan's constant) कि (संपूर्ण-विकिरण स्थिरांक)"]
['Phys. (pole-strength per unit area) च (चुम्बकीय-ध्रुवशक्ति)']
['Phys. (surface density) त (तल-घनता)']
['Scorpii = Cor Scorpii = Antares प्रथम वृश्चिक m., ज्येष्ठा f.', 'Physics']
['Scorpii = Graffias = Acrab द्वितीय वृश्चिक m.', 'Physics']
['Scorpii = Dschubba चतुर्थ वृश्चिक m., अनुराधा f.', 'Physics']
['Scorpii = Shaula एकादश वृश्चिक m., मूल n.', 'Physics']
['Scorpii अष्टादश वृश्चिक m.', 'Physics']
['Scorpii ऊनविंश वृश्चिक m.', 'Physics']
['Scorpii = Lesath विंश वृश्चिक m.', 'Physics']
['spikenard ']
[]
['(McLean I.308)']
['V']
['wavy border', 'तरंगित तट']
['X']
['Y']
['Z']
परिष्कृत्य सूचयामः। धन्यवादः।
Regards,
Ebs team
Namaste,
We have done our best to clear all the blank cells from the original file and also made sure of the values are in the appropriate columns. The file has been converted into .tsv post the rectification and placed in the below path.
https://drive.google.com/drive/folders/1hF2gyPKGfn4QzYfSnxgvyglr_v4kykNb?usp=drive_link
Request you to please verify the same and let us know for any further.
Regards, EBS team
सुकृतम्। https://github.com/indic-dict/stardict-hindi/blob/gh-pages/en-head/tars/tars.MD इत्यत्र संस्कृतभारतीजनैर् दत्तया सञ्चिकया जनितः stardict-कोशो लभ्यः।
Example entries -
From raghuvIra_gov_ed (en-hi)Collapse article acceleration
a, f (acceleration) Physics Phys. त्व (त्वरण) acceleration
f (acceleration) Physics Phys. त्व (त्वरण)
https://sanskrit-coders.github.io/dictionaries/offline/Stardict/#how-to-install-and-use-dictionaries-on-your-device इत्य् अनुसृत्यापि स्थापयित्वा प्रयोक्तुं शक्यम्।
घोषणाः - https://groups.google.com/g/samskrita/c/YBqwvWcoSBI
(reopening for other dicts listed in https://github.com/indic-dict/stardict-hindi/issues/3#issuecomment-657933386 )
धन्यवादः।
Regards, EBS team