This is the start of work to cleanup the columns in rnc_accessions. Generally, this pull request stops writing and loading specific columns which do not appear to be useful. This should be safe to just run, but as far as I can tell tests are broken right now. Prior to running this pipeline this we must manually edit rnc_update.update_rnc_accession function in the database to remove references to the columns cleaned up here. Otherwise it will break when loading things into the database. For reference the columns are:
map
allele
ordinal
pseudogene
old_locus_tag
anticodon
division
common_name
classification
species
operon
The taxonomic columns are not needed but left in the entry object for now. There needs to be more careful work to remove them as we do use them for some logic within the pipeline.
I've been working through the database tests and have fixed some of them. Fixing those is going to take me a while. I can start tracking it but I suspect this is very slow going.
This is the start of work to cleanup the columns in
rnc_accessions
. Generally, this pull request stops writing and loading specific columns which do not appear to be useful. This should be safe to just run, but as far as I can tell tests are broken right now. Prior to running this pipeline this we must manually editrnc_update.update_rnc_accession
function in the database to remove references to the columns cleaned up here. Otherwise it will break when loading things into the database. For reference the columns are:The taxonomic columns are not needed but left in the entry object for now. There needs to be more careful work to remove them as we do use them for some logic within the pipeline.