Closed avinashbarnwal closed 4 years ago
you used survreg with dist="lognormal" which means your outputs should be all positive.. is that true? The outputs.csv.gz files have upper/lower limits which are sometimes negative, so for these data you should use "normal" instead.
@tdhock Are these limits in log scale? My understanding is that survival times are positive (time-to-event).
all the outputs.csv.gz files are limits on log scale
This problem is nearly solved. I had to change the logic for finding the infinite values and had to remove 0 variance variables.
09/27/2019
10/3/2019
09/11/2019
Below Picture is best Fold1 Fold2 Fold3 Fold4 Fold5 One Fold - Fold1 is considered as Test dataset and Rest from Fold 2 -5 are considered training the dataset and all hyperparameter optimization is done on that. Similarly, Fold 2 becomes new test dataset and rest of datasets become the training dataset and all hyperparameter optimization is done on that.
Based on the above observation we are going to have different hyperparameters for each fold.