RobokopU24 / NewSourceProposals

New Knowledge Providers (KPs) for the Data Management Oversight Group (DMOG) to review
0 stars 0 forks source link

Gene Expression Omnibus (GEO) #4

Open eKathleenCarter opened 8 months ago

eKathleenCarter commented 8 months ago

In an effort to remove Hetionet as a KP (#117) but preserve the data there within, this KP is recommended for addition.

GEO is a public functional genomics data repository supporting microarray data run by NCBI. Hetionet used this data set to inform its metaedges: Disease - upregulates - gene (DuG) Disease - downregulates - gene (DdG) There is an R package to help with the ingestion of data https://www.ncbi.nlm.nih.gov/geo/info/geo2r.html Data sets can also be accessed using Entrez Programming Utilities (E-Utils)

Hetionet focused on 48 diseases in their Search Tag Analysis Resource Gene Expression Omnibus (STARGEO) data set which can be found here: https://zenodo.org/records/46866

I suggest we expand that list.