dieterich-lab / scimodom

GNU Affero General Public License v3.0
0 stars 0 forks source link

RNA types #97

Open eboileau opened 1 month ago

eboileau commented 1 month ago

Aims/objectives.

The nomenclature for RNA types (table rna_type) is fixed e.g.

+---------+-------------------+
| id      | name              |
+---------+-------------------+
| circRNA | circular RNA      |
| mRNA    | messenger RNA     |
| miRNA   | micro RNA         |
| rRNA    | ribosomal RNA     |
| snRNA   | small nuclear RNA |
| tRNA    | transfer RNA      |
+---------+-------------------+

This means that anyone can eventually upload modification data associated with any of these RNA types. According to the specification,

a given dataset or bedRMod file can contain more than one modification, as reported in column 4 (MODOMICS short name), but this should be for the same RNA type.

otherwise it becomes practically impossible to search records, at least the way the model is designed.

A clear and concise description of todo items.

eboileau commented 4 weeks ago

The terminology is now fixed to

+------+---------------------+
| id   | name                |
+------+---------------------+
| tRNA | transfer RNA        |
| WTS  | whole transcriptome |
+------+---------------------+

It is built around the concept of sequencing method rather than the biological definition of RNA species. WTS thus stands for mRNA, ncRNAs, etc.

I started to work on integrating tRNAs...

We can eventually add rRNA, but unless I'm mistaken, this would not require changes, as they will be annotated with Ensembl and be handled as WTS.