I am having issues with TRF (version 4.09) missing multiple short homopolymers.
I've produced an example below with a (modified) sequence from the yeast genome that contains two motifs with period size 1 and a copy number of 6. Both motifs are detected in the first sequence, but single-nucleotide changes outside the repeat result in trf not reporting either the first or the second sequence, depending on the change.
The command I am calling is
trf test.fa 2 7 7 80 10 8 1000 -h -d -ngs
The contents of test.fa (with the inserted nucleotides capitalized) are:
I am having issues with TRF (version 4.09) missing multiple short homopolymers.
I've produced an example below with a (modified) sequence from the yeast genome that contains two motifs with period size 1 and a copy number of 6. Both motifs are detected in the first sequence, but single-nucleotide changes outside the repeat result in trf not reporting either the first or the second sequence, depending on the change.
The command I am calling is
trf test.fa 2 7 7 80 10 8 1000 -h -d -ngs
The contents of test.fa (with the inserted nucleotides capitalized) are:
Is this behavior expected? Is there any way to modify it?
Thank you!
- Eugene