When trying to make the pairwise comparison for many isolates,
eg to make the stats to produce when we have to look at isolates or not
THEN the nextflow script for the wrangling of the results to the database gets too long/big because it needs to simlink all the alternative files
Solution ?
[ ] consider splitting this process in to chuncks - the problems is that it needs to be written sequentially in the same db - cannot have several db
[ ] consider adding also an option where groups of samples should be compared (not all that are here - but this could wait after. because I need the distribution right now of similarities and differences -> open new issue then after some thoughts
When trying to make the pairwise comparison for many isolates, eg to make the stats to produce when we have to look at isolates or not THEN the nextflow script for the wrangling of the results to the database gets too long/big because it needs to simlink all the alternative files
Solution ?