rhpvorderman / sequali

Fast sequencing data quality metrics
GNU Affero General Public License v3.0
9 stars 0 forks source link

Add very common repeats in the human genome. #153

Open rhpvorderman opened 2 months ago

rhpvorderman commented 2 months ago

I also found this sequence to be present a lot:

CCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCAC
revcomp:
GTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGG

It maps to human ImmunoGlobolin heavy chain, but is also a very common repeat structure in the human genome. I need to do some more searching.