snayfach / MIDAS

An integrated pipeline for estimating strain-level genomic variation from metagenomic data
http://dx.doi.org/10.1101/gr.201863.115
GNU General Public License v3.0
119 stars 52 forks source link

Inconsistent species results #107

Open mcleanlab opened 4 years ago

mcleanlab commented 4 years ago

Hello, We have been using Midas for some time, specifically a docker version quay.io/fhcrc-microbiome/midas In setting up a new machine I installed Docker and did a test run of Midas using some previously analyzed data. The new results did not match the old results. The pan-genomes were the same, but the number of hits was different. At first I assumed that this was because of a database update. However, I found that I can run Midas species on the same dataset, with the same database, from the same docker image just 20 minutes apart or so and I get different results. I have not previously noticed this behavior, but haven't looked for it. I had assumed that the matching function did not involve randomness and that the results would be repeatable. Is Midas working as intended or has something gone wrong with my implementation. Thank you for your time Mcleanlab