The Atomspace of the gene annotation service (MOZI-AI) integrates the follwoing list of biology datasets (of Homo sapiens species).
The Biogrid: BioGRID is a repository for Interaction Datasets.
Reactome pathway: The complete list of pathways and hierarchial relationship among them.
The three Physical Entity (PE) Identifier mapping files Physical Entity (PE) Identifier mapping.py imports the following
Small molecule Pathway database (SMPDB)
The Metabolite names linked to SMPDB pathways and Protein names linked to SMPDB pathways
Gene ontology database
The Genes and their ontology GO (classes used to describe gene function and relationships betweeen these classes)
STRING Protein-Protein Interaction Networks Functional Enrichment Analysis
The imported Atomese version of the datasets can be found https://mozi.ai/datasets/
NOTE: For expermenting only on a gene-level, the following scripts generates only gene_level and reduced size (without extra information like name, pubmedId ...) version of the data
Atomese format description with links to source datasets https://docs.google.com/document/d/16zfY7OZtHO66mfujLdZ0-3VALXUTvxeeo4dW2ASBiNs