single-cell-genetics / vireo

Demultiplexing pooled scRNA-seq data with or without genotype reference
https://vireoSNP.readthedocs.io
Apache License 2.0
71 stars 27 forks source link

Handling Multiple Sequencing Lanes and Donor Information in VireoSNP #101

Closed sxf-ux closed 3 months ago

sxf-ux commented 4 months ago

Hello,

I am currently working with 10X single-cell RNA-seq data from multiple sequencing lanes and fragments. Specifically, I have a sample (Sample X) which has been sequenced in two lanes, resulting in BAM files for Sample X-1 and Sample X-2. Additionally, I have a sample (Sample Y) were Sample Y-1 and Sample Y-2 may come from different brain areas of the same individual.

I have processed these BAM files (they've been treated separately) using cellSNP-lite and MQuad where I have generated AD and DP matrices for each lane/fragment and relevant vcf files. Now, I am aiming to run VireoSNP for demultiplexing and clonal assignment.

My questions are as follows:

I understand that I need to specify donors before identifying clones so I need to understand what approach works best for my dataset.

Thank you in advance.

huangyh09 commented 4 months ago

Thanks for your questions.

1) for different lanes, I assume that the same cell barcodes in two lanes refer to the cell cell, so I would suggest you run cellranger by inputting them together, so you will have a single bam file. 2) for two biological samples of a donor, one way to merge AD and DP is to keep the same list of variants (before or after cellsnp-lite) and then concatenate the cells. Another way is you use cellranger to merge two bams into a new bigger bam with cell barcodes with suffixes -1 and -2 for the two samples.

Yuanhua