Benjamin-Lee / deep-rules

Ten Quick Tips for Deep Learning in Biology
https://benjamin-lee.github.io/deep-rules/
Other
226 stars 46 forks source link

Understand the trade-off between interpretability and performance #25

Open signalbash opened 5 years ago

signalbash commented 5 years ago

Deep learning methods are notably difficult to interpret. If data is provided without explicitly engineered features, how are you going to address finding any biases in predictions?

What sort of problem are you trying to solve with DL? Is high performance from a DL approach worth the difficulty in explaining how the model assigns a value, or is the value in the model in understanding the biological problem at hand?

evancofer commented 5 years ago

I think something can be learned about the problem by determining which models perform well, right? For instance, the fact that FactorNet benefits from information such as DNase-seq data says something about TF binding. Obviously, there is a limit to how much can be learned this way, and one should always guard against over-interpretation. However, I would argue that high-performing models can help us generate insightful hypotheses (which we should then verify) about the biological phenomena being modeled.

tbrittoborges commented 5 years ago

@evancofer can you please link FactorNet repository or article reference? Thanks!