Closed ning-y closed 3 years ago
Thanks, @ning-y, for the report. I'll fix the error, but it won't help you much. The processed data were not submitted to GEO, so you'll need to grab the supplemental files and then hand-import them.
@seandavi Thanks for the quick reply! The confirmation is good enough for me
For others coming across this issue, it might be sufficient to use reutils
if you do not need the full range of information that comes with GEOquery
. For example, I was just trying to get a vector of GSMs for a GSE. That can be accomplished with
esearch("GSE35126", db="gds") %>%
uid() %>% esummary(db="gds") %>% content("xml") %>%
getNodeSet("//DocumentSummary[./Accession='GSE35126']") %>% first() %>%
getNodeSet("//Sample/Accession") %>% xmlValue()
I am interested in getting the title column and some associated characteristics columns. I tried getGEO("GSE139204", GSEMatrix=TRUE, destdir = paste0(current_dir, "/work"))
for GSE139204, but it does not work today.
However, I can see the series_matrix file is available online. Do you have suggestions on how to parse the downloaded series_matrix file? Please advise what to do. Thanks.
Hi @chiwwong, I do not know off the top of my head what information reutils::esearch
returns, but it may be worth looking through its XML to see if the information you want is available there.
library(reutils)
library(magrittr) # for the pipes (%>%)
esearch("GSE139204", db="gds") %>%
uid() %>% esummary(db="gds") %>% content("xml")
If it's available, you should be able to write the xpath query to extract them. Note that reutils
is a different R/Bioconductor package than GEOquery
. GEOquery
is great for somethings, but for metadata reutils
is usually sufficient, in my opinion.
This is fixed in b81fe0aaa63cd0667858b243d85bed2577302da3.
To replicate:
Full error message:
Session info: