Closed HenrikBengtsson closed 9 years ago
From more troubleshooting, I'm pretty sure 'GSM1199232_AB227-HuGene-1_0-st-v1-01-1A6.CEL' is a corrupt CEL file. More specifically, it is a truncated file. After having read the "intensities", the following data section is "stddevs" which is only partly read before it reaches the end of the file - (it reads 194,108 bytes out of 4,410,000 wanted). This is also indicated when using the following alternative to read the file:
> library("affxparser")
> pathname <- "GSM1199232_AB227-HuGene-1_0-st-v1-01-1_A6_.CEL"
> hdr <- readCelHeader(pathname)
> nbrOfCells <- hdr$cols * hdr$rows
> nbrOfBytes <- 4 * nbrOfCells # stored as float:s
> nbrOfBytes
[1] 4410000
> data <- readCcg(pathname)
Error in dim(raw) <- c(bytesPerRow, nbrOfRows) :
dims [product 4410000] do not match the length of object [194108]
More evidence. This sample is part of data set GSE49439. Comparing the file size of the CEL.gz files for a few of the samples in that set, this file stands out, e.g.
So, 99.99999% sure it's a corrupt CEL file.
This works:
but the following causes R to crash:
The value of
readHeader
,readXY
andreadIntensities
makes no difference.On Linux, one gets:
On Windows 64-bit, one gets:
Appendix
Arguments of
readCel()
:This CEL file can be downloaded as: