Closed cybersiddhu closed 9 years ago
feature
table.uniquename
column will have the unstable identifier equivalent to gid or pid of genbank that will change with change in sequence.accession
column of `dbxreftable will have the stable identifier(accession no) and the
version`` column will have the version number.dbxref_id
column in feature
table.uniquename
column. That uniquename
value will be the new unstable identifier for the new sequence entry.entry by copying the
acessionvalue from the previous but increasing the value of
version`` column(+1).dbxref
entry by copying the uniquename
value and adding 1
to the version
column.uniquename
column with the value of primary key. A uuid value would be another alternative.Its beyond the scope of this issue, should be handled by the middleware.
Note: This implementation does not bring the versioning schema for the existing sequencing center and curated gene models.
A versioning model will be applied for majority of the sequence features. The idea will be primarilly borrowed from GenBank. Every feature will have an sequence id(internal and akin to GID or PID in GenBank) and stable identifier(accession no in GenBank). The stable identifier always starts with version 1. Any change in feature sequence will create a new feature entry with a new sequence id whereas the stable identifier remain intact and increment its version number(1 becomes 2). In other words, all features with identical stable identifiers will differ in their version and the one with higher version would be the canonical one. The feature history of sequence changes will also be preserved.
Tied to https://github.com/dictyBase/Migration/issues/5