Open SilasK opened 10 months ago
I have a small issue, I run fastp with the command line below.
It detects high duplication rate (30%) also some of the reads seem very short as seen in the insert size hist.
The base histograms shows some weird fluctuation at the beginning, which I take to be primer dimers.
My question is if there are short inserts that overlap, the reads cannot be longer than the insert, and should be remove, isn't it?
fastp --thread 8 --qualified_quality_phred 8 --length_required 75 --low_complexity_filter --detect_adapter_for_pe --correction --cut_tail --dup_calc_accuracy 5 --cut_mean_quality 15 --dedup --in1 Sample_R1.fastq.gz --in2 Sample_R1.fastq.gz --out1 qc_sample_R1.fastq.gz --out2 qc_sample_R2.fastq.gz --json report.json --html report.html
I have a small issue, I run fastp with the command line below.
It detects high duplication rate (30%) also some of the reads seem very short as seen in the insert size hist.
The base histograms shows some weird fluctuation at the beginning, which I take to be primer dimers.
My question is if there are short inserts that overlap, the reads cannot be longer than the insert, and should be remove, isn't it?