Closed vlegoff closed 6 months ago
Thanks, I will have a look at the PR
@vlegoff I refined the PR and merged it to the main branch, let me know if anything goes super wrong! cutoff
arg is now t_max
.
I think it would be interesting to use all data (train and test) to estimate the censoring distribution used for weighting, in the same way as in the Graf score & with the same justification.
We never use both train and test data for G(t)
(not even in graf). But certainly we use all of training data first, before applying the t_max
cutoff. See these lines where the estimation happens and later we give the t_max
to the C function which filters observations pretty much
We never use both train and test data for G(t) (not even in graf)
Yes, I misread the doc for the Graf score, thanks for point it out!
Hello mlr3proba team,
In Uno's article about the C-Index, he mentions truncating the C-Index with a prespecified τ:
the following being the justification for this:
This is possible in the actual implementation of the C-index through the
cutoff
parameter, but when working with multiple datasets (e.g. in a benchmark), it would be interesting to use a censoring proportionp_max
, in the same way as with the Graf score.Reference Uno, H., Cai, T., Pencina, M. J., D'Agostino, R. B., & Wei, L. J. (2011). On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Statistics in medicine, 30(10), 1105–1117. https://doi.org/10.1002/sim.4154