RNAcentral / rnacentral-import-pipeline

RNAcentral data import pipeline
Apache License 2.0
2 stars 1 forks source link

Write URL to it's own column #185

Closed blakesweeney closed 9 months ago

blakesweeney commented 9 months ago

This change writes the URL, if any, to the URL column which is then loaded and put into rncaccessions. At the moment that column does not exist in the load or final table and so needs to be before this is merged. Additionally, this does not stop writing the URL in the note field to not break things for the upcoming release.

Once this is merged, we need to add the column, defaulting to NULL to accessions and the load_ table. Then the update function needs to be updated. We then should test out using this column instead of however URLs for accessions are calculated now. That can wait until after this release. Future work should move the schema for these tables and function into a controlled location, be it here or the web repo.