biothings / mygene.info

MyGene.info: A BioThings API for gene annotations
http://mygene.info
Other
115 stars 20 forks source link

Ensembl Sources #59

Closed namespacestd0 closed 5 years ago

namespacestd0 commented 5 years ago

Adds:

Changes:

More on the species list file:

Inspection shows the old species.txt have 163 records, BioMart dropdown list has 138 options, dataset_names.txt has 138 records (match dropdown list). There are 25 records(species) in species.txt that are not in dataset_names.txt. Of which, 15 of them are musmusculus*, 1 "Test" record, these are safe to exclude. The remaining 9 records include 4 records that are variations of another record, with the same taxid.

The following speces are just not present with the same taxid in species.txt:

These species are not present in the BioMart dropdown list. dataset_names.txt does not exist for the other Ensembl databases besides the main one. species.txt exisits for all databsses and matches the BioMart dropdown list. The findings above should justify this change.