dictyBase / Modware-Loader

Various data munging and loading scripts for genome database
2 stars 1 forks source link

Versioning for entries with sequence and annotation #150

Closed cybersiddhu closed 9 years ago

cybersiddhu commented 9 years ago

A versioning model will be applied for majority of the sequence features. The idea will be primarilly borrowed from GenBank. Every feature will have an sequence id(internal and akin to GID or PID in GenBank) and stable identifier(accession no in GenBank). The stable identifier always starts with version 1. Any change in feature sequence will create a new feature entry with a new sequence id whereas the stable identifier remain intact and increment its version number(1 becomes 2). In other words, all features with identical stable identifiers will differ in their version and the one with higher version would be the canonical one. The feature history of sequence changes will also be preserved.

Tied to https://github.com/dictyBase/Migration/issues/5

cybersiddhu commented 9 years ago

Chado database implementation

Data model

Managing versions

cybersiddhu commented 9 years ago

Software implementation

Bootstrap the data model

Managing versions

Its beyond the scope of this issue, should be handled by the middleware.

Note: This implementation does not bring the versioning schema for the existing sequencing center and curated gene models.