pepkit / geofetch

Builds a PEP from SRA or GEO accessions
https://pep.databio.org/geofetch/
BSD 2-Clause "Simplified" License
45 stars 5 forks source link
download-manager geo metadata sra sra-data

geofetch logo

PEP compatible Run pytests docs-badge pypi-badge Code style: black install with bioconda

geofetch is a command-line tool that downloads sequencing data and metadata from GEO and SRA and create metadata tables in standard PEP format. geofetch is hosted at pypi. You can convert the result of geofetch into unmapped bam or fastq files with the included sraconvert command.

Key geofetch features:

Docs


Documentation: https://pep.databio.org/geofetch/

Source Code: https://github.com/pepkit/geofetch/


Installation

To install geofetch use this command:

pip install geofetch

or install the latest version from the GitHub repository:

pip install git+https://github.com/pepkit/geofetch.git

How to cite:

https://doi.org/10.1093/bioinformatics/btad069

@article{10.1093/bioinformatics/btad069,
    author = {Khoroshevskyi, Oleksandr and LeRoy, Nathan and Reuter, Vincent P and Sheffield, Nathan C},
    title = "{GEOfetch: a command-line tool for downloading data and standardized metadata from GEO and SRA}",
    journal = {Bioinformatics},
    volume = {39},
    number = {3},
    pages = {btad069},
    year = {2023},
    month = {03},
    abstract = "{The Gene Expression Omnibus has become an important source of biological data for secondary analysis. However, there is no simple, programmatic way to download data and metadata from Gene Expression Omnibus (GEO) in a standardized annotation format.To address this, we present GEOfetch—a command-line tool that downloads and organizes data and metadata from GEO and SRA. GEOfetch formats the downloaded metadata as a Portable Encapsulated Project, providing universal format for the reanalysis of public data.GEOfetch is available on Bioconda and the Python Package Index (PyPI).}",
    issn = {1367-4811},
    doi = {10.1093/bioinformatics/btad069},
    url = {https://doi.org/10.1093/bioinformatics/btad069},
    eprint = {https://academic.oup.com/bioinformatics/article-pdf/39/3/btad069/49407404/btad069.pdf},
}