cov-lineages / constellations

Other
43 stars 17 forks source link

constellations

This repository contains descriptions of constellations of mutations for the SARS-CoV-2 virus.

A constellation is a collection of mutations which are functionally meaningful, but which may arise independently a number of times.

Here we include files that define:

In addition we include a JSON file describing the reference sequence and the coordinates of genes/proteins. Mutations can therefore be described with respect to the amino acid position within these features.

Definitions

At a minimum, JSON files must contain the following:

The general format of a mutation code is: gene:[ref]coordinates[alt] where gene is a gene code (or nuc for the genomic nucleotide sequence), ref is the nucleotide or amino acids in the reference, alt is the specific nucleotide or amino acid for the mutatant. Either of ref or alt can be missing if no specific state is required. See https://github.com/cov-lineages/scorpio for more definitions.

Rules can either specify [min|max]_[ref|alt|ambig|oth] OR the call required at a mutation e.g. "N:S235F": (not )[ref|alt|ambig|oth]

Optional fields