Closed kbergin closed 5 years ago
@barkasn @kishorikonwar Tim suggests that this paper may have already benchmarked umi-tools vs alevin and may mean that we don't need to do benchmarking to choose a tool we could just use results from papers that have shown benchmarking. https://www.biorxiv.org/content/early/2018/06/01/335000
Perhaps there are other papers that have benchmarked this algorithm
We have decided to implement umi-tools as a first pass and are communicating with the developers of Alevin for their help using their tool in the future. When they are ready we will put that into Optimus.
Current cell barcode correction methods lose some reads due to them having more than one error. We could implement another method that would allow us to quickly find cell barcode and UMI barcode errors that are greater than hamming distance one.
Suggestions:
CGATOxford/UMI-tools could be another choice, which is written in Python has UMI correction and duplication methods. Ian Sudbery from Sheffield Institute for Nucleic Acids, University of Sheffield, seems to be enthusiastic to point or help us explore this tool.
Ambrose suggested looking into the barcode correction algorithm is described in the supplement of the paper Massively Parallel Single-Cell RNA-Seq for Marker-Free Decomposition of Tissues into Cell Types
AC