Closed thak123 closed 8 years ago
Hi,
What we have in the Indic NLP library is a word segmenter and not a true morph analyzer, i.e. the library can break a word into its component units. So you will not directly get a stem, but may have to do some post-processing. I can suggest a procedure that may work.
e.g. The Marathi word घरासमोरचा may be broken as घरा समोर चा
Now, you can have the following ways of obtaining the stem:
e.g. महेश्वराचा may be segmented as महे श्वरा चा. Taking only the first word would be wrong in this case.
As for using the segmenter, this documentation should help:
Hope this helps.
~Anoop
Thanks for the prompt reply. I'll try and submit the results
Can you please tell me how i can use the existing morphological analyser in order to get the stems of the words provided as the input to the indic library.