Netflix-Skunkworks / riskquant

Apache License 2.0
611 stars 61 forks source link

Low loss and high loss in real life #13

Open veeral-patel opened 4 years ago

veeral-patel commented 4 years ago

Let's take an example of Alice stealing our company's customer data, which is worth $1M to us.

Intuitively, the low loss is that she fails to take any data ($0) and the high loss is that she steals all the data ($1M).

Am I choosing the low loss and high loss correctly here? Also, I read that the low loss and high loss should create a 90% confidence interval...how does that apply here?

mdeshon commented 4 years ago

A $0 loss isn't really a loss, and in any case the lognormal can't capture a 0 low loss accurately (it can't deal well with large high loss/low loss ratios).

Instead when I am eliciting low/high loss values I describe them like this:

veeral-patel commented 4 years ago

@mdeshon thanks for your reply!

detection systems worked, we noticed the alarm as soon as is feasible, we stopped further loss, and recovered quickly

The way I'm thinking about it is that the loss is correlated with the amount of customer data lost. In the low loss scenario, a "small amount" of customer data is lost, but that feels arbitrary to me.

Should I say 1% of data was stolen for low loss? 5%? 20%?

High loss is easier for me. I just compute the cost of losing the data itself, plus secondary costs like regulatory fines, litigation, loss of employee productivity, hiring forensics firms, etc. It's a reasonable worst case scenario...am I thinking about this right?

veeral-patel commented 4 years ago

@mdeshon wanted to follow up!