Closed sooyounga closed 1 year ago
let's add a slot in our model for biotype
called molecular type
and lets' make the range of that slot be an enum with two values: protein-coding
and noncoding
.
also once the other csv mapping is done, let's use that to add values to the in_taxon
and referenced in
slots.
for the other csvs, we want to collate and map that information into GenomeAnnotation
, whose id
would then by used for the value of referenced in
.
is this rebased on current master? seems like a lot of unrelated files have been updated.
based on provided small data set of 20230412, the columns are mapped to these attributes in our model:
synonym
id
prefix (stylized)id
suffixsymbol
name
translator.py is first draft of how we can take such data in csv format and load it into yaml data format to load into our model class