Closed ilwookkim closed 5 months ago
I would first ask at Zymo for their primer sequence. Because it can be of great advantage to filter reads with primer sequences. Alternatively, there are some options:
--skip_cutadapt
-c cutadapt.config
that contains (see -u 20 -U 20
for forward and reverse 20 nuc primer) [not tested, you might need to solve the variables!]
process {
withName: CUTADAPT_BASIC {
ext.args = { [
"--minimum-length 1 -u 20 -U 20",
"-O ${params.cutadapt_min_overlap}",
"-e ${params.cutadapt_max_error_rate}",
params.pacbio ? "--rc -g ${meta.fw_primer}...${meta.rv_primer_revcomp}" :
params.iontorrent ? "--rc -g ${meta.fw_primer}...${meta.rv_primer_revcomp}" :
params.single_end ? "-g ${meta.fw_primer}" :
"-g ${meta.fw_primer} -G ${meta.rv_primer}",
params.retain_untrimmed ? '' : "--discard-untrimmed"
].join(' ').trim() }
}
--skip_cutadapt
and use a config (similar to above) to make filtntrim trim reads, see here, by using an appropriate configEdit: Essentially, I am not for adding another feature to solve this, because it is not good practice to not know the primer sequence.
Thanks for the answer. I would like to try second option because Zymo refused to provide their primer seq :( Thanks again. Best,
As detailed in #744 my suggestion doesnt work. Please do the following:
-c cutadapt.config
as above--FW_primer GGGGGGGGGG --RV_primer GGGGGGGGGG --retain_untrimmed --cutadapt_min_overlap 10
--qiime_ref_taxonomy
or --cut_dada_ref_taxonomy
because they will require correct primersThis will give fake primer sequences to cutadapt so that it wont complain. It will set a primer match to at least 10*G (which will never match) and allow all reads that did not contain a primer (i.e. all) to pass. But it will remove by -u
& -U
the unknown primer sequences.
Let me know how that goes.
Thanks a lot! It works without any issue.
Great! Then I'll close that issue, feel free to open another one if you come across another problem. You could also join nf-core slack via https://nf-co.re/join to get access to a more chat like function for questions like that.
Description of feature
Hi all,
Recently I start to analysis 16s rRNA seq. Our lab used Zymo Quick-16S Primer Set V3-V4 for the amplicon. However the zymo doesn't provide exact seqeunce of primer, instead provide only length of primer.
When I provide primer seq as 'N{16}', cutadapt refuses to use very long wildcard string. Therefore I would like to trim the primer seq by length using fastp.
Please let me know whether it is possible or not. Thanks a lot.