I've implemented souporcell for mixture of six-samples. and I have whole exome sequencing results for individual samples (i.e. six .fastq files).
Then, how do I identify the assocations between souporcell clusters and samples?
I think cluster_genotypes.vcf contains hint for this but, I don't understand GO:GN part (0/0, 0/1 or 1/0, 1/1 likelihood, but minial log likelihood was not selected as GT part) I want to know how to extract cluster specific variants.
If we have cluster representing SNVs from 1, we may utilzed this info. to identify samples with WXS results.
I've implemented souporcell for mixture of six-samples. and I have whole exome sequencing results for individual samples (i.e. six .fastq files). Then, how do I identify the assocations between souporcell clusters and samples?