Open nsheff opened 1 year ago
The latest minimal schema has been updated to this:
description: "A collection of biological sequences."
type: object
properties:
lengths:
type: array
collated: true
description: "Number of elements, such as nucleotides or amino acids, in each sequence."
items:
type: integer
names:
type: array
collated: true
description: "Human-readable labels of each sequence (chromosome names)."
items:
type: string
sequences:
type: array
collated: true
items:
type: string
description: "Refget sequences v2 identifiers for sequences."
accessions:
type: array
collated: true
items:
type: string
description: "Unique external accessions for the sequences"
required:
- names
- lengths
- sequences
ga4gh:
inherent:
- names
- sequences
We decided to start with two schemas: a minimal schema that we would post now as what we should implement, and then an extended schema, which is in evaluation stage to see if it should end up in the minimal schema. Here are some drafts of these for comment and revision:
Minimal seqcol schema
Extended seqcol schema