NIAID-Data-Ecosystem / nde-crawlers

Harvesting infrastructure to collect and standardize dataset and computational tool metadata
Apache License 2.0
0 stars 1 forks source link

[Source]: BioStudies ArrayExpress #140

Open gtsueng opened 6 months ago

gtsueng commented 6 months ago

Source Name

BioStudies ArrayExpress

Source URL

https://www.ebi.ac.uk/biostudies/arrayexpress

Source Description

The functional genomics data collection (ArrayExpress), stores data from high-throughput functional genomics experiments, and provides data for reuse to the research community. In line with community guidelines, a study typically contains metadata such as detailed sample annotations, protocols, processed data and raw data. Raw sequence reads from high-throughput sequencing studies are brokered to the European Nucleotide Archive (ENA), and links are provided to download the sequence reads from ENA. Data can be submitted to the ArrayExpress collection through its dedicated submission tool, Annotare.

Short description

ArrayExpress is an NIH supported repository that includes high-throughput genomics data in the biomedical domain.

Source Access

No access issue, account not needed

Source Funding

EMBL

Source Relevance

NIAID medium priority resource

Related WBS task

For internal use only. Assignee, please select the status of this issue

Status Description

No response

Source to-do list

gtsueng commented 5 months ago

This repository has been evaluated as a medium-priority repository for integration. With Dylan's work on MassIVE and MalariaGEN, Jason's work on VEuPathDB collections, and my manual curation to create ResourceCatalogs, we've pretty much covered all of the high-priority resources, and can now start on the medium priority ones.

Note that records from BioStudies-ArrayExpress may potentially be aggregated by OMICs-DI, we should keep an eye out for duplication of records.