CGATOxford / UMI-tools

Tools for handling Unique Molecular Identifiers in NGS data sets
MIT License
493 stars 190 forks source link

Collapse barcode to generate whitelist based on frequency #619

Closed ashokpatowary closed 11 months ago

ashokpatowary commented 11 months ago

I have barcode frequency in the following format from single cell long read data. What will be the best way to generate an error corrected white list from this data. @IanSudbery has suggested in one post to use getCellWhitelist but I failed to do it. Would you suggest how can I do it? Thanks

  13574 ATGCTAACCTACCGTAATC
  12244 ATGCTAACCTAGGCATCTT
   8185 CCTTGGCTCTATGGCGCTT
   8135 ATGCTAACCTAAACGTTGC
   7340 ATGCTAACCTATGACCGGA
---------------------------------
--------------------------------
      1 AAAAAAAAAAACATCGTTC
      1 AAAAAAAAAAACAGCGAAC
      1 AAAAAAAAAAACAGAACGA
      1 AAAAAAAAAAAAGGCTCTA
      1 AAAAAAAAAAAACTACGCA