Sum02dean / MLG

Machine Learning in Genomics Course ETH
MIT License
3 stars 3 forks source link

Determine window size #40

Open Sum02dean opened 2 years ago

Sum02dean commented 2 years ago

For each sequence, count the average number of nucleotides from start position to TSS (start).

The window size should not contain too many Ns (padding values). Identity the average distance from position_1 - TSS(end).

The window should be symmetrically sized around the +++TSS-start and TSS-end+++

Expecto used 20kbp either side.