jmschrei / tfmodisco-lite

A lite implementation of tfmodisco, a motif discovery algorithm for genomics experiments.
MIT License
56 stars 16 forks source link

Motifs missing when input sequence less than 400 bp #11

Closed ziwei-75 closed 1 year ago

ziwei-75 commented 1 year ago

If the user doesn't set the -w option, Modisco-lite may produce different results compared to standard Modisco when the input sequence is less than 400 bp. It would be good for the program to automatically account for this scenario, either by raising an error or by using the provided sequence length.

jmschrei commented 1 year ago

v2.0.7 will raise an error if the window is longer than the sequence -- or, more specifically, if the extracted subsequences are not of the same size as the window.