With #152, the initial exploration of genetic demultiplexing (#127) could be considered mostly done, so I am creating a new issue for next steps. All of the components for genetic demultiplexing are there, but they are in separate workflows, with incomplete connections among them.
The task now is to join those all together so a single workflow can take as input the run data table and produce as output a set of files with all of the information needed for demultiplexing. Whether that would be a single SCE object with sample assignment as a part of colData, or separate SCE objects for each sample is not yet decided.
The steps are these:
for a given multiplexed library, identify the bulk samples that correspond to the multiplexed sample set
map relevant bulk samples with STAR
jointly call SNPs with mpileup from the bulk mapping to create a VCF
map single cell/nucleus samples with STARsolo
use bulk VCF and mapped single cell results to call single cell SNPs & demultiplex
(possibly) combine demultiplex info with previous quantification (including cellhash data)
With #152, the initial exploration of genetic demultiplexing (#127) could be considered mostly done, so I am creating a new issue for next steps. All of the components for genetic demultiplexing are there, but they are in separate workflows, with incomplete connections among them.
The task now is to join those all together so a single workflow can take as input the run data table and produce as output a set of files with all of the information needed for demultiplexing. Whether that would be a single SCE object with sample assignment as a part of colData, or separate SCE objects for each sample is not yet decided.
The steps are these: