Closed xhochy closed 10 years ago
I think cross validation is meant to be random. In our training data there is a clear bias towards Ham. Taking a uniform distribution should allow us to generate folds that contain roughly the same distribution of ham vs spam.
I think cross validation is meant to be random. In our training data there is a clear bias towards Ham. Taking a uniform distribution should allow us to generate folds that contain roughly the same distribution of ham vs spam.