Open Chiyun-Lee opened 5 months ago
Notes for developers:
Backend: change code which currently excludes all het haplotypes (two+ strings separated by a comma) to identify which are phaseable/phased and keep them in df_clonal.
Option 1 - include major haplotype only (string to left of comma)
Option 2 - include major and minor as separate haplotypes. This will require duplicating the sample row and renaming sample IDs to such as “Sample A (maj)” and “Sample A (min)”
Frontend: implement option to include 1) just clonal haplotypes, 2) clonal + phaseable hets
Currently, heterozygous samples (containing more than one strain) are removed from analysis. Some samples can potentially be split into two new samples if they are "phaseable" into the two corresponding phases. Doing this would increase the number of samples available for overall analysis.