Closed dennyabrain closed 7 months ago
From what I broadly understand we want to do - Lemmatization and Stemming The root word is called - Lemma
Lemmatization for English
Lemmatization for Hindi
4.2.9
Root
and Not Root
This issue is stale because it has been open for 30 days with no activity.
This issue was closed because it has been inactive for 14 days since being marked as stale.
We are using the phrase "root word" as a catchall term. There are many words that are misspellings of a common slur. For instance fuck could be spelled as fck, fk, fcuk etc.
This was highlighted as a problem during our annotation sprints too. Annotators weren't sure if they should annotate all the misspellings or would it be ok to just annotate the root word and our system will understand that those annotations are valid for all derived words.
In scope for this issue is,