ga4gh / vrs

Extensible specification for representing and uniquely identifying biological sequence variation
https://vrs.ga4gh.org
Apache License 2.0
80 stars 32 forks source link

Molecular type attribute #179

Closed ahwagner closed 2 weeks ago

ahwagner commented 4 years ago

Sequences are currently untyped, but may benefit from DNA/RNA/AA designation. This was an early design decision I think we should revisit. Starting this thread as a place to collect use cases and issues for making such a change.

rrfreimuth commented 4 years ago

+1 to the idea, but details are TBD.

Is the intent for this to restrict the alphabet used to represent the sequence? Or is it to facilitate consumption or processing of the sequence? I suspect the latter, but we need to clarify this.

My initial thoughts are a single attribute to capture type, bound to an ontology that allows the author to subtype as needed (e.g., mRNA vs. RNA). That approach provides flexibility but may complicate digests as a result. An alternative is to bind a simple value set containing the main molecular types (DNA, RNA, protein).

github-actions[bot] commented 4 years ago

Stale issue message

ahwagner commented 3 years ago

To be considered for v2