Open agitter opened 7 years ago
The most fundamental flaw here is interpreting conv. filters as PWMs. No single filter is ever going to actually capture a complete representation of the binding motif it might be partially capturing. DNNs learn distributed representations. So there are and will be multiple partially redundant filters that collectively model a binding site. More so, for deeper networks the conv. filters in the first layer often don't even remotely resemble known PWMs/motifs. The higher layers are often learning these. This may be actually more useful for initializing CNNs with known PWMs by transforming the PWMs to a more appropriately scaled but equivalent (in terms of performance) conv. filter.
https://doi.org/10.1101/163220
Opening this for discussion. I didn't read it, but the amount of related work they're missing is an immediate red flag. And this line in the abstract isn't correct: