RNAcentral / rnacentral-import-pipeline

RNAcentral data import pipeline
Apache License 2.0
2 stars 0 forks source link

Import REDIportal data #169

Open blakesweeney opened 1 year ago

blakesweeney commented 1 year ago

REDIportal provides information on the locations of modified RNA nts in the human genome. We can think of their data as a BED file of locations and some metadata about the type of modifications. This will be an expert database that does not provide any sequences, but we do have one like this, CRS. To import their data we need to:

We also need to provide a search export that makes it possible to find all sequences with these editing events and possibly search them. Maybe adding terms like:

I'm not sure the second term is idea, so better suggestions are encourged.

Finally, we need to provide them a linkage between editing even and URS_taxid. I think a file like: