EnsemblGSOC / Ensembl-Repeat-Identification

A Deep Learning repository for predicting the location and type of repeat sequence in genome.
4 stars 3 forks source link

generate repeats statistics #10

Open williamstark01 opened 2 years ago

williamstark01 commented 2 years ago

It would be beneficial for our better understanding of the data and potentially useful for a publication to generate repeats statistics. Statistics of the length of repeats (distribution and more) is a first target, and potentially statistics on repeats coordinates might also prove useful. Other ideas for more statistics we could explore?

@EreboPSilva may take a look at this.

williamstark01 commented 2 years ago

Some statistics generated in #29

williamstark01 commented 2 years ago

More stats in #30