Closed Yangxin666 closed 4 years ago
yep
This is as intended. Think about what you might be able to learn from the training data when you have a severity score but no additional information. Also, think about how you would make a prediction if all of the measured predictors were missing.
This is as intended. Think about what you might be able to learn from the training data when you have a severity score but no additional information. Also, think about how you would make a prediction if all of the measured predictors were missing.
Well, I think its ok for some missing data in trainset. But, for test, this means you need to predict without knowing any knowledge. We need to predict id 167 & 113 without any information about it, i.e. we can only guess. If so, why not add other ids, like 168, 169, 170 etc, we also know no information for those ids.
I found subject id 134, 215, 219 are missing in the training set. These three ids appear in "severity_score_train.txt".
Similarly, subject id 167 and 113 are missing in test set but appear in "prediction.csv". Did anyone find the same issue?