Closed jbloom closed 3 years ago
I think UMI_tools
has methods for figuring out "true" barcodes from mismatches that could be applied to the viral barcodes within single cells.
Within the whole mix, there is some complicated set of viral barcodes. That is true both if we look at all barcodes over transcriptomics (across cells) or if we look at all barcodes in the supernatant.
We really only care about the viral barcodes that are in the transcriptomics.
Furthermore, within each cell, we expect there to be only at most a few distinct viral barcodes.
My suggestion is that first we identify truly infected cells in the transcriptomics.
Then we use the transcriptomic viral barcode calls to use something like UMI_tools to figure out what are the true viral barcode in that cell. And then we have our set of true viral barcodes in the transcriptomic data.
If we take this approach, we need to #58
A few ideas came out of our discussion about this issue today:
This nebulous issue has been superseded by more specific tasks. I am closing it.
@dbacsik says: