Stability of loss function for left censored data

Thanks for the great blog post, got a detailed understanding of why/what of WTTE-RNN. I've recently started working with left censored data (observed labels are higher than the true labels for censored instances) and planned to implement a similar methodology. My initial enthusiasm was quickly killed by looking at the loss function ->

I still went ahead and implemented this loss function but am running into numerical instability issues with t/alpha or beta/alpha approaching values close to zero (after clipping these values at 0 to be able to compute log).

Just wondering if you thought about left censored data and have any recommendations to leverage this methodology.

ragulpr / wtte-rnn

Stability of loss function for left censored data #66