Closed noahaus closed 7 months ago
So the scripts run, but I am unable to generate the figures because the test case is too small to have accessory genes present - necessitates re-analyzing the UK data
Results of the Accessory PCA
The separation between cattle and badger are still present - but the percent variation is way lower than before. probably because the effect of erroneous genes are mitigated + btb slow evolution.
Results of the gene prab:
I feel like something here is not fully registering, I think I need to revisit the way a virulent gene is found, because I feel like there has to be more to show
I did make some substantial changes to the code that makes this, but berceuse panaroo is more stringent, it appears that the virulent genes are the only ones that are found between 15 and 99% of isolates. might require a major rewrite!
Decided to just include the gene prab hierarchical cluster as its own output
It's nice but it would be nice to see which genes are most responsible for the clustering pattern
help messages added for each script
I wonder which Liliana will prefer