For the genes flow, we map reads back to the centroids_99 pan-genome. And use all the centroids_99 that are single copy marker genes for normalization, not only the marker genes from representative genomes. Therefore, change the infrastructure of collate_repgenome_markers.py to compute the centroids99 to marker genes mapping for all species.
midas_run_genes:
handle overlapping ranges in gene features before binary search
aligned reads should be collected before filtering
collate_repgenome_markers.py
to compute the centroids99 to marker genes mapping for all species.midas_run_genes
: