avinashbarnwal / AFTXGBoostPaper

AFT XGBOOST
6 stars 2 forks source link

Summary of Calls #1

Closed avinashbarnwal closed 4 years ago

avinashbarnwal commented 5 years ago

09/11/2019

Below Picture is best Fold1 Fold2 Fold3 Fold4 Fold5 One Fold - Fold1 is considered as Test dataset and Rest from Fold 2 -5 are considered training the dataset and all hyperparameter optimization is done on that. Similarly, Fold 2 becomes new test dataset and rest of datasets become the training dataset and all hyperparameter optimization is done on that.

Based on the above observation we are going to have different hyperparameters for each fold.

tdhock commented 5 years ago

you used survreg with dist="lognormal" which means your outputs should be all positive.. is that true? The outputs.csv.gz files have upper/lower limits which are sometimes negative, so for these data you should use "normal" instead.

hcho3 commented 5 years ago

@tdhock Are these limits in log scale? My understanding is that survival times are positive (time-to-event).

tdhock commented 5 years ago

all the outputs.csv.gz files are limits on log scale

avinashbarnwal commented 5 years ago

This problem is nearly solved. I had to change the logic for finding the infinite values and had to remove 0 variance variables.

avinashbarnwal commented 5 years ago

09/27/2019

avinashbarnwal commented 4 years ago

10/3/2019

tdhock commented 4 years ago