Chap3: How do i select a threshold from the precision and recall vs threshold curve

Hi @FritzPeleke , Thanks for your question. The optimal precision/recall depends on your task. For example, if you're building an intruder detection system, you will want to catch as many intrusions as possible (high recall). In this case, you can lower the threshold a lot, which will increase recall (less false negatives, where an intruder gets in undetected) and decrease precision (more false positives, where the alarm goes off even though there's no intrusion). However, if you decrease the threshold too much, you will start to get many false positives. You will have to decide how many false positives per day you can tolerate. If a false positive just means that a security guard will need to look at a screen for a few seconds, then perhaps you can tolerate several false positives a day. But if it means waking up someone and getting them to travel 20km, then not so much. This is just an example, you can easily imagine other tasks where precision is more important than recall (e.g., selecting videos that are safe for children to watch). Hope this helps.

ageron / handson-ml

Chap3: How do i select a threshold from the precision and recall vs threshold curve #473