alexyermanos / Platypus

R package for the analysis of single-cell immune repertoires
GNU General Public License v3.0
36 stars 16 forks source link

VDJ_available not found in all samples #24

Closed Andy-ChanKP closed 1 year ago

Andy-ChanKP commented 1 year ago

Hi again!

Sorry for opening another ticket again - I am wondering why VDJ is not found in the Seurat object for sample 7-9? Even though clearly B cell clusters are present in these samples. Must be something with the VDJ_GEX_matrix function, or my Seurat.in object. Any speculations as to how this error happens would be very welcome and appreciated.

Many thanks, Andy image

image image image
vickreiner commented 1 year ago

Hi Andy,

can you check for me if the VDJ for samples 7-9 are present in vgm_without_13F_LN table(vgm_without_13F_LN[[1]]$sample_id) and, to see if the integration worked from the side of VDJ: vgm_without_13F_LN[[1]] %>% group_by(sample_id) %>% summarise(sum(GEX_available))

Thanks a lot!

Andy-ChanKP commented 1 year ago

Thank you for your prompt reply... Very interesting... So they are present, but the integration did not work.

image
alexyermanos commented 1 year ago

Could you manually check the overlap for s7 GEX and s7 VDJ barcodes in VGM[[2]] and VGM[[1]] respectively and compare this to one that worked (e.g. s6 GEX and s6 VDJ ? e.g. using something like length(intersect())

vickreiner commented 1 year ago

Hi! I just pushed a new version of the VGM which, if verbose = TRUE, prints a pairwise intersect table of all GEX vs. all VDJ. That should highlight any mismatches or wrong orders. Could you try this? Thanks!

Andy-ChanKP commented 1 year ago

Hi Victor and Alex,

Here is the pairwise intersect table... there are certainly many mismatches...

How do I fix this?

Many thanks, Andy

image image image image image image
vickreiner commented 1 year ago

Some low levels of overlap between raw barcodes and samples is to be expected. From experience the numbers in this table are a bit higher than what I would expect.

Apart from that, there are large overlaps between several samples which (without knowing the experiment design behind it) seem out of place.

To me it looks like:

  1. GEX s9 would pair with VDJ s7
  2. GEX s8 would pair with VDJ s9
  3. There is an overlap of GEX s7 and s4; VDJ s4 pairs up with both GEX s7 and s4, while VDJ s7 pairs with s9 (1.)

Not really sure what happened here. For the future I will keep this VDJ_GEX_matrix overlap table in, so that users are made aware of possible mismatches

Andy-ChanKP commented 1 year ago

Hi Victor,

I think your interpretation is accurate - when I do exchange VDJ s7 and s9 with their "correct" GEX counterparts of s9 and s8 respectively, those B cell clusters of s8 and s9 light up. Not too sure why their GEX and VDJ paths are not the same in the first place. Human error mixing them up?

That leaves us s7, and mixing up with s4 and others in different ways.

I will discuss this with our lab members (@mbartl13) and also the core facility, and will let you know how we think it could have happened in the first place. It is so baffling...

Really appreciated your help throughout!! You guys really helped me a lot.

Many thanks, Andy

vickreiner commented 1 year ago

Always happy to help! Hope you can figure out what happened to these samples.