biocore / metagenomics_pooling_notebook

Jupyter notebooks to assist with sample processing
MIT License
8 stars 16 forks source link

Fail validation when sample-sheet contains duplicate entries in [Bioinformatics] #80

Open charles-cowart opened 2 years ago

charles-cowart commented 2 years ago

Sequence_processing_pipeline will treat duplicate entries in [Bioinformatics] as separate projects, and attempt to perform adapter trimming and/or human filtering on the same project twice. This will cause QCJob to fail when it attempts to move zero-length files for the second-time.

@antgonza and I discussed this and agreed that it would be best for this to be a new requirement for sample-sheet validation, rather than have the pipeline silently handle this use-case.