NRCan / Q2Pipe

Qiime2 based pipeline designed to facilitate and standardise ecological studies by metabarcoding using Illumina Miseq data
MIT License
1 stars 0 forks source link

Figaro Division by Zero crash #8

Open Patg13 opened 1 year ago

Patg13 commented 1 year ago

Some dataset cause a division by Zero error (ex: Copeaux ITS), this must be investigated to identify the problem

Dataset seems to have been pretrimmed, causing the sequences length to be shorter than normal, probably causing the bug. Will investigate a little more, but it's probably not a bug with Figaro itself.

EDIT: The dataset was problematic because the longest sequence is way over the mean length, so when the offset is applied, the resulting trimming length is too high, which reject almost all reads in the dataset. I will not close this yet, because this point out a weakness in the Figaro trimming parameter definition (longest length - user-defined offset). A better way to determine the offset must be found.

Maybe wait for Figaro2 which should include a mechanism to fix this issue