Microbial-Ecology-Group / AMRplusplus

AMR++ is a bioinformatic pipeline meant to aid in the analysis of raw sequencing reads to characterize the profile of antimicrobial resistance genes, or resistome.
https://www.meglab.org/
GNU General Public License v3.0
28 stars 12 forks source link

Samples are failing at runsnp step of AMR++ #10

Closed LLansing closed 1 year ago

LLansing commented 1 year ago

I am running AMR++ pipeline on 177 paired samples. The entirety of the run is successful except that 18 samples fail the runsnp step of the pipeline with the following error in the nextflow output:

[c8/b4aa4e] NOTE: Process STANDARD_AMRplusplus:FASTQ_RESISTOME_WF:runsnp (SRR13495884) terminated with an error exit status (1) -- Error is ignored

and in the .command.err files, they all end with a similar cut error:

... ++ awk -v RS=, '/SRR13495950/{print NR; exit}' AMR_analytic_matrix.csv

This results in those 18 samples missing from the SNPconfirmed_AMR_analytic_matrix.csv file.

A previous run of the pipeline with the same data had 25 samples with this error, and only 3 erroneous samples in common between the two runs. This makes it seem like the error is somewhat random.

(Cross-posted from AMRplusplus_SNP)

Isabella136 commented 1 year ago

May be connected to #9 since as of now both issues always occur together

EnriqueDoster commented 1 year ago

@LLansing, sorry for the delay but we just pushed updates to AMR++ (v3.0.1) and the AmrPlusPlus_SNP software which should solve the issues you were running into. Please try the latest version of AMR++ and let us know if you run into any further issues.