Closed pna059 closed 2 years ago
Thanks for your report; can you try the latest stable version ? We fixed a lot of bugs there... (v 2.0.2)
The version 2.0.2 still does not allow correctSystematicG=TRUE
Error: BiocParallel errors
element index: 1
first error: correctSystematicG not supported yet for CAGEexp objects
but when FALSE, the bam files are read in, but the following functions complain.
A CAGEexp object of 1 listed
experiment with a user-defined name and respective class.
Containing an ExperimentList class object of length 1:
[1] tagCountMatrix: RangedSummarizedExperiment with 6473697 rows and 6 columns
Functionality:
experiments() - obtain the ExperimentList instance
colData() - the primary/phenotype DataFrame
sampleMap() - the sample coordination DataFrame
`$`, `[`, `[[` - extract colData columns, subset, or experiment
*Format() - convert into a long or wide DataFrame
assays() - convert ExperimentList to a SimpleList of matrices
exportClass() - save all data to files
> CTSStagCountSE(ce)
Error in CTSStagCountSE(ce) :
Could not find CTSS tag counts, see ‘?getCTSS’.
> CTSScoordinatesGR(ce)
Error in h(simpleError(msg, call)) :
error in evaluating the argument 'x' in selecting a method for function 'rowRanges': Could not find CTSS tag counts, see ‘?getCTSS’.
sessionInfo()
> sessionInfo()
R version 4.1.1 (2021-08-10)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Debian GNU/Linux 10 (buster)
Matrix products: default
BLAS/LAPACK: /cvmfs/software.metacentrum.cz/spack1/software/intel-parallel-studio/linux-debian10-x86_64/cluster.2019.4-intel-iifs5g/compilers_and_libraries_2019.4.243/linux/mkl/lib/intel64_lin/libmkl_intel_lp64.so
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C LC_TIME=cs_CZ.UTF-8
[4] LC_COLLATE=en_US.UTF-8 LC_MONETARY=cs_CZ.UTF-8 LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=cs_CZ.UTF-8 LC_NAME=C LC_ADDRESS=C
[10] LC_TELEPHONE=C LC_MEASUREMENT=cs_CZ.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats4 stats graphics grDevices utils datasets methods base
other attached packages:
[1] CAGEr_2.0.2 MultiAssayExperiment_1.20.0 SummarizedExperiment_1.24.0
[4] Biobase_2.54.0 GenomicRanges_1.46.1 GenomeInfoDb_1.30.0
[7] IRanges_2.28.0 S4Vectors_0.32.3 BiocGenerics_0.40.0
[10] MatrixGenerics_1.6.0 matrixStats_0.61.0
loaded via a namespace (and not attached):
[1] VGAM_1.1-5 splines_4.1.1 DelayedMatrixStats_1.16.0 gtools_3.9.2
[5] assertthat_0.2.1 BSgenome_1.62.0 GenomeInfoDbData_1.2.7 Rsamtools_2.10.0
[9] yaml_2.2.1 pillar_1.6.4 lattice_0.20-45 glue_1.5.1
[13] XVector_0.34.0 colorspace_2.0-2 Matrix_1.3-4 plyr_1.8.6
[17] XML_3.99-0.8 pkgconfig_2.0.3 zlibbioc_1.40.0 purrr_0.3.4
[21] scales_1.1.1 stringdist_0.9.8 BiocParallel_1.28.2 tibble_3.1.6
[25] mgcv_1.8-38 generics_0.1.1 ggplot2_3.3.5 ellipsis_0.3.2
[29] cachem_1.0.6 formula.tools_1.7.1 magrittr_2.0.1 crayon_1.4.2
[33] memoise_2.0.1 fansi_0.5.0 operator.tools_1.6.3 nlme_3.1-153
[37] MASS_7.3-54 vegan_2.5-7 tools_4.1.1 data.table_1.14.2
[41] BiocIO_1.4.0 lifecycle_1.0.1 stringr_1.4.0 munsell_0.5.0
[45] cluster_2.1.2 DelayedArray_0.20.0 Biostrings_2.62.0 som_0.3-5.1
[49] compiler_4.1.1 rlang_0.4.12 grid_4.1.1 RCurl_1.98-1.5
[53] rstudioapi_0.13 rjson_0.2.20 bitops_1.0-7 restfulr_0.0.13
[57] gtable_0.3.0 DBI_1.1.1 reshape2_1.4.4 R6_2.5.1
[61] GenomicAlignments_1.30.0 dplyr_1.0.7 rtracklayer_1.54.0 fastmap_1.1.0
[65] utf8_1.2.2 KernSmooth_2.23-20 permute_0.9-5 stringi_1.7.6
[69] parallel_4.1.1 Rcpp_1.0.7 vctrs_0.3.8 tidyselect_1.1.1
[73] sparseMatrixStats_1.6.0
Version 2.1.2 is from the devel
branch; I tested it when I uploaded, but it can break because of changes in the API of other packages. The release
version (2.0.2) is stable and intended for broad use. Can you try with that one? If it also fails, can you prepare a reproducible example?
Sorry, my mistake in the last message. I updated the comment to correct the version I am currently using and added a complete session Info.
I think there might be a problem with the CAGEexp object
> str(ce)
Formal class 'CAGEexp' [package "CAGEr"] with 5 slots
..@ ExperimentList:Formal class 'ExperimentList' [package "MultiAssayExperiment"] with 4 slots
.. .. ..@ listData : list()
.. .. ..@ elementType : chr "ANY"
.. .. ..@ elementMetadata: NULL
.. .. ..@ metadata : list()
..@ colData :Formal class 'DFrame' [package "S4Vectors"] with 6 slots
.. .. ..@ rownames : chr [1:6] "D4.A" "D4.B" "D24.A" "D24.B" ...
.. .. ..@ nrows : int 6
.. .. ..@ listData :List of 3
.. .. .. ..$ inputFiles : chr [1:6] "Result/BWA_HISAT2_mapped/IMorexV3//DAG4/D4.A.bam" "Result/BWA_HISAT2_mapped/IMorexV3//DAG4/D4.B.bam" "Result/BWA_HISAT2_mapped/IMorexV3//DAP24/D24.A.bam" "Result/BWA_HISAT2_mapped/IMorexV3//DAP24/D24.B.bam" ...
.. .. .. ..$ inputFilesType: chr [1:6] "bam" "bam" "bam" "bam" ...
.. .. .. ..$ sampleLabels : chr [1:6] "D4.A" "D4.B" "D24.A" "D24.B" ...
.. .. ..@ elementType : chr "ANY"
.. .. ..@ elementMetadata: NULL
.. .. ..@ metadata : list()
..@ sampleMap :Formal class 'DataFrame' [package "S4Vectors"] with 6 slots
.. .. ..@ rownames : NULL
.. .. ..@ nrows : int 0
.. .. ..@ listData : Named list()
.. .. ..@ elementType : chr "ANY"
.. .. ..@ elementMetadata: NULL
.. .. ..@ metadata : list()
..@ drops : list()
..@ metadata :List of 1
.. ..$ genomeName: chr "BSgenome.MorexV3.Gatersleben"
What would be the reproducible example? Our BSgenome and a subset of one of the BAM files?
Yes, the best for me is one or two BAM files subsetted so that they contain only a few megabytes of data, plus your BSgenome
package if it is needed to reproduce the bug (that is: if the bug happen only if genomeName
is not NULL
), plus the R commands to trigger the bug.
By the way, please note that in CAGEr 2.0 the output of a command needs to be explicitly assigned, like in: ce <- getCTSS(ce)
.
Thank you for the "BTW"! Not explicitly assigning the output was my fault. I can move forward in the pipeline now :)
Excellent; you can close this issue if the original problem is fixed. (Feel free to open new issues for new problems)
Hi, I am running CAGEr using a (large) custom genome. Starting with the bam files, I am getting BiocParallel error already with the getCTSSes function. Could you help me to identify the problem? THANKS!