B-UMMI / INNUca

INNUENDO quality control of reads, de novo assembly and contigs quality assessment, and possible contamination search
GNU General Public License v3.0
13 stars 7 forks source link

v3.1 #9

Closed miguelpmachado closed 7 years ago

miguelpmachado commented 7 years ago

Change assembly filtering step: try filtering contigs based on SPAdes K-mer coverage, contigs length and GC content, and if it fails filter only based on contigs length and GC content (to try to rescue the sample from failing) Change PASS, FAIL, WARNING rules (FAIL overrides WARNING):

Make SPAdes QC assessment only depending of SPAdes run information (not relying on AssemblyMapping anymore). Remove Trimmomatic and Pear information from QC assessment. Update E. coli maximum number of multiple alleles in TrueCoverage module. Include ReMatCh (https://github.com/B-UMMI/ReMatCh) as dependency for TrueCoverage module running. Make INNUca compatible with new MLST version. Add option –fastQCproceed to force INNUca to continue even if a sample fails FastQC. Add option --maxNumberContigs to set the maximum number of contigs per 1.5 Mb of expected genome size (useful for species that intrinsically produce a more fragmented genome assembly). Change --spadesUse_3_9 to –spadesVersion to specify a SPAdes version (default: 3.11.0). Add option --noLog to tell INNUca to not create a log file (useful in Slurm environment since stdout and stderr are usually saved in a file). Add option --noGitInfo to tell INNUca to not retreive GitHub repository information (useful when running INNUca in parallel independent jobs for many samples since it might save some time). Change final general report (PASS, FAIL, WARNING samples). Write failing and warning reports for each sample. Change dependencies checking. Fix minor errors. Update README. Add Dockerfile with Docker image recipe creation.