Open JoRussell-IDM opened 3 years ago
unique_indices_df
linelist table of all unique infections in my population sample
https://github.com/jribado-IDM/tskit_test/blob/workflow-refactor/utilities/ibs-from-sites.py
give it a np array with 2000 different strains, returns list that has the hash of each 2000, then has subsequent functionality for the hash lookup in my sample back to a matrix against hash vs hash.
For collapse of simulated genome to sequenced observation: https://github.com/jribado-IDM/tskit_test/blob/workflow-refactor/postparent/workflow/genepi_to_genomicformat.py
Need parallel structure for identity : root : density
could use other parallel metadata to pre-filter the unique_genomes that enter into this whole pipeline
Working on this with sql_to_df.py in https://github.com/JoRussell-IDM/emod-parasite-genetics/tree/master/garki_long_run_genetic_behavior/analyzers
Plots of InsetChart
Aim for comparable Garki scenario using Co-transmission model and FPG model with comparable analysis of genetic feature