Closed bjmt closed 3 years ago
Oops. Actually it would be more to accurate to say the error occurs at:
datamat <- read_tsv(fname, quote = "\"", na = c("NA", "null", "NULL", "Null"),
skip = series_table_begin_line,
comment = "!series_matrix_table_end",
skip_empty_rows = FALSE)
sorry,can you ask me how to solve this proplem?
Have the same problem for GSE53258.
Apparently, the problem is in incorrect skip = series_table_begin_line
value.
@seandavi this can be fixed by explicitly splitting lines by \n
to dat
:
text <- readr::read_file(fname)
dat <- strsplit(text, "\n", fixed=T)[[1]]
However, this breaks parsing of GSE781. It seems the behavior is different between read.table
and read_tsv
. May be switch to the latter completely?
Investigating further led me to believe the error occurred at this call within
GEOquery:::parseGSEMatrix()
:I downloaded the
GSE27957_series_matrix.txt
file from GEO and looked for what might be wrong. What I found was that the!Sample_data_processing
entries had carriage return characters (\r
) within each entry, for example:After going through and manually deleting these,
getGEO()
worked fine.