sanskrit-lexicon / alternateheadwords

Prepare list of alternate headwords for all Cologne dictionaries
1 stars 0 forks source link

English-Sanskrit Dictionaries Alternate Word Extraction #24

Open gasyoun opened 4 years ago

gasyoun commented 4 years ago

In Apte Student's English-Sanskrit Dictionary we have Cave, Cavern, s. guhā, darī, gahvaraṃ, kaṃdaraḥ- -rā, vivaraḥ-raṃ. As of guhā, darī, gahvaraṃ it's simple. All we would want to is to be able to split off in gahvaraṃ for search purposes.

We have kaṃdaraḥ--rā and vivaraḥ-raṃ, that should equal, but are not now: 1) kaṃdaraḥ 2) kaṃdarā 3) vivaraḥ 4) vivaraṃ

In Borooah: CAVE, CAVERN: I Natural: (1) guhā, entered the great c. praviveśa mahāguhām, V. p.; (2) gahvaram; (3) kandaraḥ (rī, raṃ); (4) darī or dariḥ. So kandaraḥ (rī, raṃ): 1) kandaraḥ 2) kandarī 3) kandaraṃ

And in Monier: 3) CAVE or CAVERN, s. vilaṃ, vivaraṃ, kandaraḥ -rā, garttaḥ, gahvaraṃ -rī, randhraṃ, guhā, guhāgṛhaṃ, nirddariḥ m., darī f., śailarandhraṃ. — (Excavation) khātaṃ, devakhātaṃ, devakhātavilaṃ. there is kandaraḥ -rā: 1) kandaraḥ 2) kandarā

Any idea what can be done, to improve the English-Sanskrit Dictionaries section?

funderburkjim commented 4 years ago

Are you talking about 'Advanced Search: Sanskrit text search' ? If so, a 'prefix search' has effect of dropping off the ending anusvara or visarga:

image

Is

funderburkjim commented 4 years ago

Is this issue related to https://github.com/sanskrit-lexicon/csl-corrections/issues/13 ?

gasyoun commented 4 years ago

Are you talking about 'Advanced Search: Sanskrit text search' ?

No, basic search.

Is this issue related to sanskrit-lexicon/csl-corrections#13 ?

I would not say so. But the arose looking at the same text.

funderburkjim commented 4 years ago

No. basic search,

Please elaborate. I'm not sure what you're thinking of. Some new kind of display?

gasyoun commented 4 years ago

Some new kind of display?

Yes, your global kind of.

I'm not sure what you're thinking of.

I will never find kaṃdarā in a list of synonyms, as it is spelled typographically kaṃdaraḥ--rā now.

funderburkjim commented 4 years ago

I don't know how to proceed with this at the moment. It seems that the advanced search example I provided above is related to your question, but not quite the same as what you have in mind.

gasyoun commented 4 years ago

advanced search example I provided above is related to your question, but not quite the same as what you have in mind.

It's not actually a single issue. 1) If you search for kaṃdara in MW there is no such word, but there is kandara in 6 results: kaṃdhara kandara kaṇḍarā kāṇḍāra kandarā khaṇḍara. So first question - can this be added to extra headwords ṃ/n?

I wanted to find all the places a cave is mentioned in Mahabharata. We remember that http://sanskrit-linguistics.org/dcs/ is based on MW. I opened all the English-Sanskrit Dictionaries and gathered all the possible words to be searched for cave in all dictionaries.

2) kandaraḥ is mentioned in dictionary, but for googling purposes I would want to be able to extract the list of synonyms without Nominal ending as well.

3) kaṃdaraḥ--rā is a subcase of kandaraḥ, because it's not a single entry, but two. Would love to see it encoded as:

Hope I'm clearer now. Thanks for listening.