AfshinLab / BLR

MIT License
5 stars 0 forks source link

Introduce primary contigs #52

Closed pontushojer closed 3 years ago

pontushojer commented 3 years ago

Contigs are now defined at three levels:

Post-processing steps run by primary contigs but not all:

Reasons to add primary contigs include:

FrickTobias commented 3 years ago

Looks really nice. Since it was a really good description I would also copy paste part of your description in the PR into the permanent documentation docs/develop.rst.

Contigs are now defined at three levels:

  • all = every contig in reference
  • primary = every contig in reference that should go through certain post-processing steps (see below). Is a subset of all
  • phased = every contig in reference that is diploid i.e. can be phased. Is a subset of primary.

Post-processing steps run by primary contigs but not all:

  • find_clusterdups + get_barcode_merges
  • concat_molecule_stats + get_barcodes_to_filter
  • call_variants
pontushojer commented 3 years ago

@FrickTobias Thanks for the input! I have added a new section in develop.rst relating to the pipeline which contains the info.