Dadmatech / DadmaTools

DadmaTools is a Persian NLP tools developed by Dadmatech Co.
Apache License 2.0
184 stars 40 forks source link

How can I train Dadmatech/Nevise on a custom dataset? #72

Open MohammadAmin001 opened 3 months ago

MohammadAmin001 commented 3 months ago

Congratulations on the tool you have provided. I had a few questions about the Nevise model: Is this model trained only with FAspell dataset? If another dataset is used: What is the structure of the database you used to train this model?

Does the dataset include correct and misspelled labels for words, or have you used sentences and labeled each of them as correct or misspelled?

How many sentences (or whatever) did you use to train the model?

Is it possible to access the data you used?

What is the method of training the model? Is the model being trained continuously or is it trained once and can be used now?