Closed ghost closed 6 years ago
As this is a question instead of an issue, could you please post it at the Bioconductor support forum: https://support.bioconductor.org. In this way, more users will benefit!
You have to specify some parameters to your specific data. For example, the 'Reverse' column is completely empty so filtering on this column doesn't make sense. Additionally, there is no 'Gene.names' column, so you should specify another column in the _makeunique function. See code below to obtain a SummarizedExperiment object.
An additional warning: your data only consists of 98 quantified proteins and most of these proteins are only quantified in a single sample. After filtering out proteins with too many missing values, the dataset only consists of tens of proteins. This is too few to perform reliable normalization and FDR calculations.
`
proteins <- read.csv("proteinGroups.txt", header = TRUE, sep = "\t", stringsAsFactors = FALSE) expdesign <- read.csv("ExpDesign.txt", header = TRUE, sep = "\t", stringsAsFactors = FALSE)
proteins <- filter(proteins, Potential.contaminant != "+") proteins_unique <- make_unique(proteins, "Majority.protein.IDs", "Protein.IDs", delim = ";")
columns <- grep("LFQ.", colnames(proteins_unique)) se <- make_se(proteins_unique, columns, expdesign) `
ExpDesign.txt
This is a question and not necessarily an issue.I have encountered some problems based on my lack of understanding whilst using your package.I have zealously followed the instructions in the vignette and it seems as if some of the functions are set to expect specific column names with inflexibility. I have an output file from a LFQ MaxQuant analysis and some of the column names you mention do not exist and this has disallowed me from creating a summarized experiment object.
My question is, how can I create a summarized experiment object to allow me to continue with my analysis ?
Below is a documentation of some errors that I have encountered.
Matrix products: default
locale: [1] LC_COLLATE=German_Germany.1252 LC_CTYPE=German_Germany.1252 LC_MONETARY=German_Germany.1252 LC_NUMERIC=C
[5] LC_TIME=German_Germany.1252
attached base packages: [1] stats4 parallel stats graphics grDevices utils datasets methods base
other attached packages: [1] DEP_1.0.1 bindrcpp_0.2 readr_1.1.1 dplyr_0.7.4
[5] rpart.plot_2.1.2 rpart_4.1-11 SummarizedExperiment_1.8.0 DelayedArray_0.4.1
[9] matrixStats_0.52.2 Biobase_2.38.0 GenomicRanges_1.30.0 GenomeInfoDb_1.14.0
[13] IRanges_2.12.0 S4Vectors_0.16.0 BiocGenerics_0.24.0
loaded via a namespace (and not attached): [1] nlme_3.1-131 ProtGenerics_1.10.0 bitops_1.0-6 doParallel_1.0.11 RColorBrewer_1.1-2
[6] MSnbase_2.4.0 tools_3.4.1 R6_2.2.2 DT_0.2 affyio_1.48.0
[11] tmvtnorm_1.4-10 lazyeval_0.2.1 colorspace_1.3-2 GetoptLong_0.1.6 mnormt_1.5-5
[16] compiler_3.4.1 MassSpecWavelet_1.44.0 preprocessCore_1.40.0 sandwich_2.4-0 scales_0.5.0
[21] mvtnorm_1.0-6 psych_1.7.8 affy_1.56.0 stringr_1.2.0 digest_0.6.12
[26] foreign_0.8-69 XVector_0.18.0 pkgconfig_2.0.1 htmltools_0.3.6 limma_3.34.2
[31] htmlwidgets_0.9 rlang_0.1.4 GlobalOptions_0.0.12 impute_1.52.0 shiny_1.0.5
[36] BiocInstaller_1.28.0 shape_1.4.3 bindr_0.1 zoo_1.8-0 mzID_1.16.0
[41] BiocParallel_1.12.0 RCurl_1.95-4.8 magrittr_1.5 GenomeInfoDbData_0.99.1 MALDIquant_1.17
[46] Matrix_1.2-10 Rcpp_0.12.13 munsell_0.4.3 imputeLCMD_2.0 vsn_3.46.0
[51] stringi_1.1.6 MASS_7.3-47 zlibbioc_1.24.0 plyr_1.8.4 grid_3.4.1
[56] shinydashboard_0.6.1 lattice_0.20-35 splines_3.4.1 multtest_2.34.0 circlize_0.4.2
[61] hms_0.4.0 mzR_2.12.0 xcms_3.0.0 ComplexHeatmap_1.17.1 rjson_0.2.15
[66] reshape2_1.4.2 codetools_0.2-15 XML_3.98-1.9 glue_1.2.0 pcaMethods_1.70.0
[71] data.table_1.10.4-3 httpuv_1.3.5 foreach_1.4.3 gtable_0.2.0 RANN_2.5.1
[76] purrr_0.2.4 tidyr_0.7.2 norm_1.0-9.5 assertthat_0.2.0 ggplot2_2.2.1
[81] mime_0.5 xtable_1.8-2 broom_0.4.3 survival_2.41-3 tibble_1.3.4
[86] iterators_1.0.8 gmm_1.6-1
proteinGroups.txt