milaboratory / mixcr

MiXCR is an ultimate software platform for analysis of Next-Generation Sequencing (NGS) data for immune profiling.
https://mixcr.com
Other
335 stars 79 forks source link

MiXCR Default Library #1586

Closed bshim181 closed 8 months ago

bshim181 commented 8 months ago

Hello,

I had a question regarding the MiXCR's defaut library reference sequences. I noticed there were some differences in terms of list of alleles for specific v genes and what v genes were included in the reference sequences. Could you elaborate little more on how the default library was built?

Also, if i would want to compare the scTCR output from cell ranger with bulk TCR output from MiXCR to do clonotype expansion tracking over time, how would you recommend synchronizing the reference sequences?

mizraelson commented 8 months ago

1) The MiXCR library is based on genomic information and does not include alleles directly. To infer alleles from the data, there is a dedicated function,mixcr findAlleles . This method is considered more reliable because, in many other databases, hypermutations and sequencing/PCR errors are often mistakenly identified as alleles.

2) I recommend comparing different approaches by mapping the actual VDJ region nucleotide sequence, as the sequence should remain consistent regardless of the allele identification.