jtamames / SqueezeMeta

A complete pipeline for metagenomic analysis
GNU General Public License v3.0
366 stars 79 forks source link

Why didn't this tool remove the genomic information of the host, such as human or mouse? #552

Closed luozhy88 closed 1 year ago

luozhy88 commented 1 year ago

1 Why didn't this tool remove the genomic information of the host, such as human or mouse? 2 Why didn't this tool remove common contaminant ?

fpusan commented 1 year ago

1) Because you didn't ask it to. See the manual of v1.6 (option --filter in sqm_mapper.pl) to see how it's done. 2) You can preprocess your reads with sqm_mapper.pl to remove host reads and whatever else you think would be a "common contaminant". We don't do this by default, since the definition of "common contaminant" is context and researcher-dependent.

fpusan commented 1 year ago

You need to define what you want to consider "common contaminants" in the scope of your study, and prepare the files for those yourself. We can not help with that.

luozhy88 commented 1 year ago

How can use sqm_annot.pl? can you give an example?

image
jtamames commented 1 year ago

Please read the manual. All your questions can find a solution there.

luozhy88 commented 1 year ago

I need to host 2 samples by sqm_mapper.pl, but the output is always a filtered sample for fq. Why? image

image image

jtamames commented 1 year ago

Could you please include the syslog file?

jtamames commented 1 year ago

Anyway, I think I got it. Change line 267, reading: if($filter) { filter($thissample,$outsam,$outdir,$pairs{$thissample}{pair1},$pairs{$thissample}{pair2}); exit; } by: if($filter) { filter($thissample,$outsam,$outdir,$pairs{$thissample}{pair1},$pairs{$thissample}{pair2}); next; } And run the script again

luozhy88 commented 1 year ago

Thanks,it works!