MontgomeryLab / tinyRNA

tinyRNA provides an all-in-one solution for precision analysis of sRNA-seq data. At the core of tinyRNA is a highly flexible counting utility, tiny-count, that allows for hierarchical assignment of reads to features based on positional information, extent of feature overlap, 5’ nucleotide, length, and strandedness.
GNU General Public License v3.0
1 stars 1 forks source link

tiny-count: support a sequence-based counting mode when GFF files aren't provided #277

Closed AlexTate closed 1 year ago

AlexTate commented 1 year ago

The pipeline currently requires the user to specify at least one reference GFF. However, there may be instances where they don't have or want to use genomic/reference annotations (.fa/.gff). For example, instead of a genomic fasta file they may instead want to use a synthetic fasta with known sequences to count against.

The behavior of tiny-count will be changed to allow for this. If a user does not specify any GFF files:

AlexTate commented 1 year ago

Reopening this issue to add support for the Classify as... values in this mode. They're illustrated as being part of Stage 1 selection so I didn't include them in order to avoid confusion and be consistent with "Stage 1 selection is skipped." With the current explanation of "these reference sequences behave like features that had matched every rule in Stage 1," I think we could include them without adding too much confusion.