Closed schristley closed 1 year ago
The germline set should contain one AlleleDescription
for each allele. Mention in the description.
The rearrangement calls should reflect the label
field in the AlleleDescripition
.
The version number is updated under the control of the repository hosting the germline set. reflect discussion into AIRR standards coc: if someone copies a germline set they should put their own number on it, if they change it
Add validation code - >1 definiton of an allele in R and Python libraries
(closed in error)
Closed now the documentation is updated.
The AlleleDescription schema supports a
release_version
which is important as curation/updating of germline genes will be an ongoing process for many species. However, some open questions remain regarding how to manage that provenance, and what that implies for AIRR objects that may reference or use the germline genes. Here are some questions that should be resolved and documented:v_call
,d_call
,j_call
? I believe the answer isgene_symbol
except in the case when it is empty (hasn't been assigned yet), and thencoding_sequence_identifier
should be used.Depending upon the answers, we might add validation rules to enforce our decisions.