milaboratory / mixcr

MiXCR is an ultimate software platform for analysis of Next-Generation Sequencing (NGS) data for immune profiling.
https://mixcr.com
Other
336 stars 79 forks source link

Same barcode assigned to two different cellGroups in clones.tsv file from 10X TCR data #1807

Closed genome2020 closed 1 month ago

genome2020 commented 1 month ago

Hi,

I recently ran MiXCR v.4.7.0 with my 10X TCR data on multiple samples. Here is the exact command I used.

mixcr -Xmx100g analyze 10x-sc-xcr-vdj --threads 8 --species hsa sample1_R1_001.fastq.gz sample1_R2_001.fastq.gz sample1

For some samples I noticed same cell barcode with different 'cellGroup' in the clones.tsv file. See example:

Screen Shot 2024-09-30 at 3 20 31 PM

I would expect that cellGroup for the same barcode would be the same as I would expect all the TCR chains corresponding to same barcode/cellGroup belonging to the same cell. Could you please explain why there are two cellGroups for the same barcode?

mizraelson commented 1 month ago

It’s hard to say without closer investigation, but generally speaking, this can happen in cases when multiple cells are captured in the same droplet. If MiXCR finds enough information in the dataset, it can identify that two different cells are marked with the same barcode ID. For example, imagine you have 100 cells with clones AB and 500 cells with clones CD. If you then find a cell with both AB and CD clones in it, MiXCR will assign them to the according cell groups.