Closed jzyee closed 6 years ago
is it just the change in the binary parameter in the initialization of the EmailDataset?
EmailDataset() provides parameters to set binary and real value vectorizers. If binary flag is set to False, it is loading the presence of words stored in feature vector. The 2nd situation is by default when the binary flag is not set.
The robust naive bayes learner(Adversary Aware) has some scalability issues. We are fixing it.
I am trying to set up naive Bayes learners(bernoulli and multinomial), how do i set up the vectorizers for two separate situations?
I believe the load_dataset function is doing the 2nd situation currently by default, is this right?