Open NudnikShpilkis opened 3 years ago
You can follow lifelines recommendations, except for 3. Is there high-collinearity in the dataset? Try using the variance inflation factor (VIF) to find redundant variables.
as it shouldn't make a difference, since we are only using hazard predicted on xgboost as feature for fitting the lifelines WeibullAFT model.
I've tried a few of the fixes listed above. Interestingly, when I use 2.
, scaling the duration vector down, I get vastly accelerated survival curves. Is there a second change I have to run to un-scale my predictions from a survival curve? Or are the predictions of the curve scale-invariant?
Should the time-bins used by XGBSE also be scaled?
When running a
XGBSEStackedWeibull
, I get alifelines.exceptions.ConvergenceError
with the following message:Given the pipeline nature of
XGBSEStackedWeibull
. Are there recommended steps to getting past the convergence error? I.E. Will the lifelines recommendations still hold, or are there other methods I should try?