RNAcentral / rnacentral-import-pipeline

RNAcentral data import pipeline
Apache License 2.0
2 stars 1 forks source link

Ensure precompute uses all expected xrefs #165

Open blakesweeney opened 1 year ago

blakesweeney commented 1 year ago

A recent help request delt with some sequenes which appeared to be active but cannot be found. Looking into it found this issue. This was caused by a bug I created at some point which imported RNA sequences instead of DNA ones. I fixed it but the fact that those sequences are marked as active in rnc_rna_precompute means there is another bug. Basically, at some point the precompute pipeline should have pulled out xrefs for those UPIs and ensured that rnc_rna_precompte.is_active is false. As that didn't happen there is likely a bug somewhere. One possible solution is to just manually fix these entries, but I'm not sure how big the bug is.

Some things that may be useful, but I'm not sure are required: