Open ShannonDaddy opened 1 year ago
This feature is available as --low_complexity_filter
— see: https://github.com/OpenGene/fastp#low-complexity-filter
This feature is available as
--low_complexity_filter
— see: https://github.com/OpenGene/fastp#low-complexity-filter
Yes,this option will filter out the whole read, but what I need is just to trim the low complexity part of the read.
@ShannonDaddy Hi did you end up using the low complexity filter option for your sample? I am using nanopore to sequence viruses that tend to have repeat regions, do you recommend using the low complexity option?
Hi, is there any option for fastp to trim or masking low complexity region in nanopore reads? As I always see some low complexity reads as follows:
@147b5220-2abe-4d1f-9507-b7d278e33efa ATGTGCTTCAGTTCAGTTACGTGTGCTGGTGCTGTCACTACTCAACAGGTGGCATGAATTAACTTACTTGCCTGTCGCTCTATCTTCGGCGTCTTGGGTGTTTAACCTACACTACACACACACCACACCACACACACACACACATACACACACCCCACAGCACACGCCCCCCACACACACAGACACCACACACACACCGCACACCACACTACACACACACCACACACCACACCACACACTACACACACACCACACACCACACCACACACACACCATACACACCACACCACACACCACACCACACACACCACACACACACAAACACACACACACACGCACACCGCCACCTGCACACACTACACACACACCACACACCACACCACACACCCACACACACACCACACACACCACACACACCACACCATACACACACACCACACACACACCACACACTACACACACCACACACACACCACACACACACCACACACAGCACACACCACATCCCACACACACACACCACACACCACACCACACACTACACACACACCACACATCGCACACCACACCACACACACCACACACCACACACACACTGCACACCACACACACCGCACACCACACACTACACACACACCACACACCACACACACCACACACACACCATACACCACACACGCACCACACACACACACCACACACCACACGCACACCACACACACCACACACACACCCATACCACACCACACACACCACACACACCACACACCACACACAGGTTAAACACCCAAACGGACATACCGCAATATCAGCACCAACAGAAGGTTAATTCATGCCACCCATATTTGGTCTTTACGTTGTTATGTGCTTCGTTCAGTTACGTATTGCTGGTGCTGCAGAGCTTTGACTAAGGAGCATGTTAACCTTTCTGTTGGTGCTGATATTGCGGCGTCTGCTTGGGTGTTTAACCTCATGAAAACGCAAATATTTTAAAAATGTAGCTTTATGCAAAAGCAAGCTGAAAGGTTTCTTGTTGCATTGTTGTACGTTGAAGCTCAGTCACTTTGCTGACATTGAGTTTCTTTTTCTCCCAGTCACCCTTCTCCACCAATGCTACTATTTATGCGAAGTGTCGGAAATTAACTTCTCATGTGACCACCCAATTCGGTTCCAGTCGCTTGGAAATGTAATCTATTC
@d4fc11d4-ef02-42b2-a908-2adc747e10b5 AGCGCCACTTGAGAGCCTGGACGATAAGAGTGAGACTCCATCTCAACAAAAATAAAAATAAATATATAACTTAGGTTATATTTTTGCTCATTAAAAAAATTCTACATAGACCTACTCCAGATGAAACCGGAGATAATATATATATTATACAAAATATTTCATACTATATCAAAAGATACTTGGCAGAAAAATTACACTGTCTTAAGAATAACAAATAAATAATTCCAGATGTCTATTCACAGATCATGGGTGGACTTATATGTAAGTACTAAACTACGTGTATAAACGTATTCATTCTCACAAAGAAAGACACATGCTGTTAATGCATATTGTTAAGTGAAAAAATAAGGTTTCAAAAAAGGATAGAAAGTCTTATCACATTTTTATTGTGAGTATATAATTGTGAAAAAGATTTAATCATACCAAAACAGAGTTGAACTAAGTGGAATGTATTAGTCTGTTCTCACACTGCTAATGAAGACATACCCAAGACTGGGTAATTTATAAAAGAAAAGAGGTTTAATGGACTCACAGTTCCACATGGCTGGGGAGGCCTCATAATCATGATGGAAGGTGAAGGGGAGTAAAGGCACATATTACATGGTGGCAGGCAAGAGAGCTTGTGCAGGAGTTAAACACCCAAGCAGACGCCGCAATATACAACCAACAGAAAGTTAATTCATACCACCTGTTAAATGACAGCACCAACTTGTGTACATGTACACACACACACACACACACACACACACACACACACACACACACACACACACACACACGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTGATTCGTTCTTTTTTTTTGATTTTGTTCCCACTTTTTTTTTTTTGGTTTCCTTTTTACGTTGGTTTTGCCTTTTTTTTTTTTTTTGGTTGATTTGGTTACGTTCTCTTTGTGTTCCCTTTTGATTTTGTTTTTGCAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTTTTTTTGTCTTTTTTTTTTGGATTCTTTTTTTGGATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTGTTTTTTTTTTTTTTTGTTGATTTTGGATTT
I think fastp is by far the most powerful tool for processing sequencing reads, it would be so great if fastp has the low complexity trimming or masking option. Thanks a lot!