Closed michoug closed 6 years ago
Hey Greg. The error message and subsequent termination is being caused by pplacer. I don't have any direct control over this.
Hi Donovan Thanks for your answer. I resolved the issue by seeing that there was only gaps in the alignment and then manually removing the files. I suppose that it's because none of the 120 genes was found. Would it be possible to check this before the pplacer step and remove the sequence from the rest of the analysis ? Greg
Hey Greg. I think it must be something more subtle than that. CheckM will default to using a "universal" set of markers if a genome contains less than 10 unique marker genes (see --unique flag).
Hey Donovan, I was not aware that GtdbTk was using checkM ! I didn't find the --unique flag in the GtdbTk script. Looking at the gtdbtk_bac120_markers_summary file, there are some case when no unique_genes or number_multiple_genes are found
Hey Greg. Sorry, too many different programs these days. The GTDB-Tk has the flag "--min_perc_aa". If you set this to something greater than zero does it filter these out for you?
Yes it does, here is one of the lines of the log
[2018-06-01 11:26:43] INFO: 18 user genomes have amino acids in <1.0% of columns in filtered MSA.
Great. I think we will actually change the default to 50% in the next release.
Hi Analysing different MAGs with potential plasmids, I get this error :
Would it be possible to skip the sequence without shutting down the software ? Cheers Greg