Open arcturusannamalai opened 4 years ago
Hi @arcturusannamalai can you please elaborate this issue . do we need to add this dataset into our library ?
@VpkPrasanna - yes you can use these datasets and form a valid word list for the spelling checker; currently the word lists are https://github.com/Ezhil-Language-Foundation/open-tamil/blob/main/solthiruthi/data/tamilvu_dictionary_words.txt etc.
@VpkPrasanna - yes you can use these datasets and form a valid word list for the spelling checker; currently the word lists are https://github.com/Ezhil-Language-Foundation/open-tamil/blob/main/solthiruthi/data/tamilvu_dictionary_words.txt etc.
SO i have to add the new datasets into the same file right ?
Use open datasets from 1) https://www.kaggle.com/disisbig/tamil-wikipedia-articles 2) https://www.kaggle.com/disisbig/tamil-news-dataset