milaboratory / mixcr

MiXCR is an ultimate software platform for analysis of Next-Generation Sequencing (NGS) data for immune profiling.
https://mixcr.com
Other
330 stars 79 forks source link

RefineTagsAndSort #1646

Closed bshim181 closed 5 months ago

bshim181 commented 5 months ago

This is an example image and explanation posted on MiXCR website. I could not fully comprehend what the threshold was accomplishing.

Screenshot 2024-05-07 at 3 21 33 PM

In general, during refineTagsAndSort, it seems to correct huge proportion of artificial barcode generated as a artifact of sequencing error.

Screenshot 2024-05-07 at 3 29 22 PM

Previously I assumed that MiXCR discards barcodes with read evidences lower than the threshold, but based on the refineTagsAndSort reports for my samples, that seems to be not the case. Rather it corrects the UMIs and most of the reads are retained. In this case, what is the threshold accomplishing? Are UMIs under that threshold corrected and converged into UMIs over that threshold?

mizraelson commented 5 months ago

MiXCR does both. It first corrects the errors in the UMI sequence and then applies the threshold.