Closed Zethson closed 2 weeks ago
Hello, thank you for your question.
This is a new feature of GEO. It seems that it was added to GEO a year ago. Unfortunately, by looking closely into this issue, I couldn't find any metadata about raw counts
, or a link to it, using the provided metadata within the GEO API. As a result, geofetch can't handle this specific GEO feature (GEO2R) 😢.
The power of geofetch is in saving all sample metadata in a nicely combined CSV file, along with downloaded processed or raw files (SRA). The GEO database consists of two main units: Projects (GSE) and Samples (GSM). Both of these units can have files, and geofetch can work in a sample-centric (GSM) way as well as in a project-centric (GSE) way. That's why users should specify what data or metadata they are interested in. In your case, it is project-level files.
Additionally, geofetch has a regex filter available for files, so you can easily download data with specific naming patterns or formats. In your case, the CLI command will look like this:
geofetch -i GSE139940 --processed --data-source series --filter GSE139940_180821_lupus_RNA-seq_results.gene.rpkm.xlsx --geo-folder .
--processed
--data-source series
--filter GSE139940_180821_lupus_RNA-seq_results.gene.rpkm.xlsx
--geo-folder .
Let me know if this command worked for you and if you have any other questions.
Thank you, this did the trick!
Dear developers,
I've been playing around with the queries for a while now and looked at the tutorials including https://pep.databio.org/geofetch/code/processed-data-downloading/ and https://pep.databio.org/geofetch/code/raw-data-downloading/ . However, there's two things that I can't seem to figure out:
GSE139940_180821_lupus_RNA-seq_results.gene.rpkm.xlsx
.Sorry for the user questions.
Thank you very much!