rodrigopivi / Chatito

🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
https://rodrigopivi.github.io/Chatito/
MIT License
876 stars 157 forks source link

taking very long time #27

Closed Egalite123 closed 6 years ago

Egalite123 commented 6 years ago

Hi, After installing 2.0, for a decently large dataset it is taking much longer than the earlier version (for the same file). Same problem on the online too. Page times out. Anyone have similar experiences?

rodrigopivi commented 6 years ago

hi @Egalite123,

can you post your example? and how many examples you need?

rodrigopivi commented 6 years ago

Just released v2.1.0, now the online ide by default prevents generating datasets that don't specify the training and testing examples required... maybe it was taking too much time for you because the dataset was too big to be handled in a simple way. for very very large datasets the npm package with default adapter should work since it uses file streams. let me know if the new version or the npm package fixes your large dataset issue. closing the ticket for now, feel free to reopen if still having this issue.

suenpun commented 5 years ago

40,000 examples it seems like its stuck.one intention better less than 1000 examles under my testting

suenpun commented 5 years ago

and another question :how to generate the regex template in the format of rasa? Is there this function?