Closed newgene closed 2 years ago
Symbol_from_nomenclature_authority
field can be an empty value of -
. We should also verify the value does not exist in other_names
already.
So I did a tally, about 120 of them have different values, 374k have the same values, the others have "-" as the value, out of a total of 35 million.
For
GeneInfoParser
here, currently we takeSymbol
column as the officialsymbol
value in MyGene.info gene object. There is also aSymbol_from_nomenclature_authority
field in thegene_info.gz
file. Most of time they should be the same or noSymbol_from_nomenclature_authority
value, but there are some cases, two "symbol" values are different. Users will not be able to search genes via theSymbol_from_nomenclature_authority
value. Here are two examples:Even though, eventually, these two "symbol" values should match from the source data, we can still include the
Symbol_from_nomenclature_authority
value to the existingother_names
field, so that users can still query for gene with these symbols.