:exclamation: This is a read-only mirror of the CRAN R package repository. MetaIntegrator — Meta-Analysis of Gene Expression Data. Homepage: http://biorxiv.org/content/early/2016/08/25/071514
6
stars
1
forks
source link
problem in processing the data for Metaintegrator #1
I have two problems with the package
First
I used getGEOData for importing and preprocessing the data, however, when I do the meta-analysis, I get this error
Error in runMetaAnalysis(exampleMetaObj, maxCores = 1) :
In GSE5388 there are not @ least two cases and/or two controlsIn GSE46449 there are not @ least two cases and/or two controls
Here is the full code
> yi<-getGEOData(gseVector = "GSE5388",formattedNames = "GSE5388",qNorm = T)
Found 1 file(s)
GSE5388_series_matrix.txt.gz
Using locally cached version: C:\Users\Sara\AppData\Local\Temp\RtmpcZNw3R/GSE5388_series_matrix.txt.gz
Parsed with column specification:
cols(
.default = col_double(),
ID_REF = col_character()
)
See spec(...) for full column specifications.
Using locally cached version of GPL96 found here:
C:\Users\Sara\AppData\Local\Temp\RtmpcZNw3R/GPL96.soft
Warning: 68 parsing failures.
row col expected actual file
22216 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
22217 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
22218 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
22219 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
22220 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
..... ....... .................. ......... ............
See problems(...) for more details.
Parsed with column specification:
cols(
.default = col_double(),
ID_REF = col_character()
)
See spec(...) for full column specifications.
Using locally cached version of GPL570 found here:
C:\Users\Sara\AppData\Local\Temp\RtmpcZNw3R/GPL570.soft
Warning: 62 parsing failures.
row col expected actual file
54614 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
54615 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
54616 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
54617 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
54618 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data
..... ....... .................. ......... ............
See problems(...) for more details.
> gse46449<-y$originalData$GSE46449> gse46449<-cleanUpPheno(gse46449)> discovery_datasets <- list(gse46449, gse5388)> names(discovery_datasets) = c(gse5388$formattedName, gse46449$formattedName)> exampleMetaObj=list()> exampleMetaObj$originalData <- discovery_datasets> runMetaAnalysis(exampleMetaObj)
Error in runMetaAnalysis(exampleMetaObj) :
In GSE5388 there are not @ least two cases and/or two controlsIn GSE46449 there are not @ least two cases and/or two controls
SECOND Problem
I have another problem regarding the preparation of data manually. I created csv file for each element of class, key, pheno and expr then converted them to matrix. However, when I check the format it gives me this error message (I have attached the files). How to convert it to matrix correctly?
checkDataObject(datasetObject,"Dataset")
Warning messages:
1: In .datasetCheckNull(dataset) :
datasetObject$formattedName is null. The formatted name is missing
2: In .datasetCheckRowColNames(dataset) :
The probes listed in expr and the probes listed in keys do not match.
3: In .datasetCheckRowColNames(dataset) : Class vector is not named.
4: In .checkClass(dataset) : Class vector is not a numeric vector.
5: In .datasetCheckType(dataset) : pheno must be a data frame.
6: In .datasetCheckType(dataset) : expr must be a numeric matrix.
7: In .datasetCheckType(dataset) :
class must be either: 1) a numeric vector of 1's and 0's OR 2) NULL
8: In .datasetCheckType(dataset) : keys must be a vector.
I have two problems with the package First I used getGEOData for importing and preprocessing the data, however, when I do the meta-analysis, I get this error
Error in runMetaAnalysis(exampleMetaObj, maxCores = 1) : In GSE5388 there are not @ least two cases and/or two controlsIn GSE46449 there are not @ least two cases and/or two controls
Here is the full code
> yi<-getGEOData(gseVector = "GSE5388",formattedNames = "GSE5388",qNorm = T)
Found 1 file(s) GSE5388_series_matrix.txt.gz Using locally cached version: C:\Users\Sara\AppData\Local\Temp\RtmpcZNw3R/GSE5388_series_matrix.txt.gz Parsed with column specification: cols( .default = col_double(), ID_REF = col_character() ) See spec(...) for full column specifications. Using locally cached version of GPL96 found here: C:\Users\Sara\AppData\Local\Temp\RtmpcZNw3R/GPL96.soft Warning: 68 parsing failures. row col expected actual file 22216 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data 22217 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data 22218 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data 22219 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data 22220 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data ..... ....... .................. ......... ............ See problems(...) for more details.
`> gse5388<-yi$originalData$GSE5388
Parsed with column specification: cols( .default = col_double(), ID_REF = col_character() ) See spec(...) for full column specifications. Using locally cached version of GPL570 found here: C:\Users\Sara\AppData\Local\Temp\RtmpcZNw3R/GPL570.soft Warning: 62 parsing failures. row col expected actual file 54614 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data 54615 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data 54616 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data 54617 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data 54618 SPOT_ID 1/0/T/F/TRUE/FALSE --Control literal data ..... ....... .................. ......... ............ See problems(...) for more details.
> gse46449<-y$originalData$GSE46449
> gse46449<-cleanUpPheno(gse46449)
> discovery_datasets <- list(gse46449, gse5388)
> names(discovery_datasets) = c(gse5388$formattedName, gse46449$formattedName)
> exampleMetaObj=list()
> exampleMetaObj$originalData <- discovery_datasets
> runMetaAnalysis(exampleMetaObj)
Error in runMetaAnalysis(exampleMetaObj) : In GSE5388 there are not @ least two cases and/or two controlsIn GSE46449 there are not @ least two cases and/or two controlsSECOND Problem I have another problem regarding the preparation of data manually. I created csv file for each element of class, key, pheno and expr then converted them to matrix. However, when I check the format it gives me this error message (I have attached the files). How to convert it to matrix correctly?
> View(expr)
> expr<-as.matrix(expr)
> pheno<-as.matrix(pheno)
>formattedName = "bipo1"
>keys<-as.matrix(keys)
>class<-as.vector(class)
> datasetObject<list(class=class,expr=expr,keys=keys,pheno=pheno,formattedName=formattedName)
What should I do?