Closed YinanZheng closed 7 years ago
Hi @YinanZheng
Thanks for submitting your package. We are taking a quick look at it and you will hear back from us soon.
The DESCRIPTION file for this package is:
Package: REMP
Type: Package
Title: Repetitive Element Methylation Prediction
Version: 0.99.0
Authors@R: c(person("Yinan","Zheng", role = c("aut","cre"), email = "y-zheng@northwestern.edu"),person("Lifang","Hou", role = c("cph"), email = "l-hou@northwestern.edu"))
Author: Yinan Zheng <y-zheng@northwestern.edu> [aut, cre], Lifang Hou <l-hou@northwestern.edu> [cph]
Maintainer: Yinan Zheng <y-zheng@northwestern.edu>
Description: Predicting locus-specific repetitive element methylation based on Illumina 450K/EPIC Methylation BeadChip Array.
License: file LICENSE
Depends:
R (>= 3.3.0),
data.table,
doParallel,
caret
Imports:
AnnotationHub,
GenomicRanges,
IRanges,
minfi,
BSgenome,
BSgenome.Hsapiens.UCSC.hg19,
IlluminaHumanMethylation450kanno.ilmn12.hg19,
IlluminaHumanMethylationEPICanno.ilm10b2.hg19,
impute,
stringi,
randomForest,
kernlab,
Suggests:
REMPdata,
knitr,
rmarkdown
VignetteBuilder: knitr
URL: https://github.com/YinanZheng/REMP
BugReports: https://github.com/YinanZheng/REMP/issues
LazyData: true
SystemRequirements: GNU make
biocViews: DNAMethylation, Microarray, MethylationArray, GenePrediction
RoxygenNote: 5.0.1
Your package has been approved for building. Your package is now submitted to our queue.
IMPORTANT: Please read the instructions for setting up a push hook on your repository, or further changes to your repository will NOT trigger a new build.
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
Congratulations! The package built without errors or warnings on all platforms.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20160920055121.html
message()
instead of cat()
.Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20160920121700.html
Hi lawremi, Thanks for the review. In the next version:
Hi @YinanZheng,
Starting build on additional package https://github.com/YinanZheng/REMPdata.
IMPORTANT: Please read the instructions for setting up a push hook on your repository, or further changes to your additional package repository will NOT trigger a new build.
The DESCRIPTION file of this additional package is:
Package: REMPdata
Version: 0.99.0
Title: Example data for REMP package
Authors@R: c(person("Yinan","Zheng", role = c("aut","cre"), email = "y-zheng@northwestern.edu"), person("Lifang","Hou", role = c("cph"), email = "l-hou@northwestern.edu"))
Author: Yinan Zheng <y-zheng@northwestern.edu> [aut, cre], Lifang Hou <l-hou@northwestern.edu> [cph]
Maintainer: Yinan Zheng <y-zheng@northwestern.edu>
Description: This package contains HapMap LCL GM12878 methylation data profiled by Illumina 450K array and EPIC Methylation BeadChip Array.
Depends: R (>= 3.3.0), S4Vectors
License: file LICENSE
LazyData: true
biocViews: Homo_sapiens_Data, MethylationArrayData, HapMap, ENCODE
NeedsCompilation: no
RoxygenNote: 5.0.1
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "WARNINGS". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMPdata_buildreport_20160920164218.html
Thank you for your contribution.
DESCRIPTION
vignette
biocLite("REMP")
will install CRAN, Bioconductor dependencies so
no need to do so explicitly.eval=FALSE
and psuedo-output e.g., at REMP.Rnw:94 --
these quickly become out-of-date. Run the actual code, while staying
within the bounds of Bioconductor space and time guidelines.R
?remp
left a difficult to decipher trail of
files and folders in my working directory. Adopt the standard R
paradigm of returning object to the user, and using objects rather
than file paths as arguments to functions.remp()
, return
the objects that are currently written to a file.GENE.COVERAGE
from remp()
return
a data.frame / DataFrame. Use metadata()
on S4Vectors
objects
like DataFrame()
or GRanges()
to record information about
fitting (? REMP.PREDICT.IMP
; avoid duplicating row names as
columns here).remp()
.tempdir()
as working directory; expose as
default argument workingDir = tempdir()
at line 30 rather than
testing in body of code.eval(parse(text(...
is seldom needed and very
error prone; adop usual programming pardigms instead.rowMeans()
rather than apply()
for
efficient calcualtion.1:n
formulations, using seq_along()
or seq_len()
instead, to avoid surprises such as1:0
.e.g., remp.R:100 improve code readability by avoiding complicated nested expressions
fneed <- c(paste0(REtype, "_Profiled.rda"), "ILMN_refGene_hg19.rda")
favail <- dir(file.path(workingDir, arrayType))
if (!all(fneed %in% favail)) {
c("foo")
is simply "foo"
.save()
/ load()
, use
saveRDS()
/ readRDS()
to facilitate assignment of data to
variables.stop(paste0())
can usually be written as stop()
verbose=FALSE
argument) and focused on
minimal progress / warning / error so that users are informed of
important rather than all events.suppressWarnings()
in code (e.g., utilities.R:180) makes
one nervous -- the programmer knows that something is going to go
poorly, but is not programming defensively to avoid the situation.man
Do you plan to revise your package in time for the current release? The deadline is today.
I am still working on the revision. Thanks for the notice though.
I have revised the parallel part using BiocParallel package. However, I came across a minor issue: when I use bplapply, the package loading message always show up multiple times (times equal to the number of the backend workers). I understand that each worker needs these packages as the apply function needs them:
Loading required package: Biostrings Loading required package: BiocGenerics Loading required package: parallel
Attaching package: 'BiocGenerics'
The following objects are masked from 'package:parallel':
clusterApply, clusterApplyLB, clusterCall, clusterEvalQ, clusterExport, clusterMap, parApply, parCapply, parLapply, parLapplyLB, parRapply, parSapply, parSapplyLB
The following objects are masked from 'package:stats':
IQR, mad, xtabs
The following objects are masked from 'package:base':
anyDuplicated, append, as.data.frame, cbind, colnames, do.call, duplicated, eval, evalq, Filter, Find, get, grep, grepl, intersect, is.unsorted, lapply, lengths, Map, mapply, match, mget, order, paste, pmax, pmax.int, pmin, pmin.int, Position, rank, rbind, Reduce, rownames, sapply, setdiff, sort, table, tapply, union, unique, unsplit
But these loading messages are unnecessary and flush away my function's diagnostic messages. Is there a way to turn off these messages?
Can you provide an example of code that I can run and that illustrates the problem?
The task is to identify "CG" sequence throughout a very large DNAStringSet object. Here is a simplified example code:
library(BiocParallel)
library(Biostrings)
library(BSgenome)
# SEQ.RE is the huge DNAStringSet object.
SEQ.RE<-getSeq(BSgenome.Hsapiens.UCSC.hg19::Hsapiens,
names = rep("chr1",4),
start = c(16777161,25165801,150994893,167772065),
end = c(16777470,25166089,150995191,167772362),
strand = c("+","-","-","+"))
# Define the apply function
.RECpGPos <- function(seq, CpG)
{
start(matchPattern(CpG, seq))
}
bp = bpparam()
bpstart(bp)
RE.CpG <- bplapply(SEQ.RE, .RECpGPos, BPPARAM = bp, CpG = DNAString("CG"))
bpstop(bp)
sessionInfo() R version 3.3.1 (2016-06-21) Platform: x86_64-w64-mingw32/x64 (64-bit) Running under: Windows >= 8 x64 (build 9200)
locale: [1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United States.1252 LC_MONETARY=English_United States.1252 LC_NUMERIC=C LC_TIME=English_United States.1252
attached base packages: [1] stats4 parallel stats graphics grDevices utils datasets methods base
other attached packages: [1] BSgenome_1.40.1 rtracklayer_1.32.2 GenomicRanges_1.24.3 GenomeInfoDb_1.8.7 Biostrings_2.40.2 XVector_0.12.1 IRanges_2.6.1 S4Vectors_0.10.3 BiocGenerics_0.18.0 [10] BiocParallel_1.6.6
loaded via a namespace (and not attached): [1] XML_3.98-1.4 snow_0.4-1 BSgenome.Hsapiens.UCSC.hg19_1.4.0 Rsamtools_1.24.0 bitops_1.0-6
[6] GenomicAlignments_1.8.4 zlibbioc_1.18.0 tools_3.3.1 Biobase_2.32.0 RCurl_1.95-4.8
[11] SummarizedExperiment_1.2.3
The package loading message appeared 6 times in my PC.
Thanks for the help.
The startup message comes when the serialized instance is sent to the workers; it's hard to avoid this other than by pre-loading the packages on the workers, e.g.,
bp = SnowParam(2)
bpstart(bp)
bplapply(seq_len(bpnworkers(bp)), function(i) {
suppressPackageStartupMessages({
library(Biostrings)
})
}, BPPARAM=bp)
RE.CpG <- bplapply(SEQ.RE, .RECpGPos, BPPARAM = bp, CpG = DNAString("CG"))
bpstop(bp)
Sending large amounts of data back and forth to workers is not advised (e.g., sending the ranges for getSeq()
, and getting the large sequence data on the worker rather than manager), and it's often MUCH better to use vectorized functions when they exist, rather than iteration like lapply()
. Also, when a vectorized version of the function exists, it's better to use bpvec()
rather than bplapply()
; bpvec()
splits the data into chunks that are vectorized, rather than sending individual elements. So
.vRECpGPos <- function(seq, CpG)
{
start(vmatchPattern(CpG, seq))
}
.vRECpGPos(SEQ.RE, CpG=DNAString("CG"))
and only if the above is slow
bpvec(SEQ.RE, .vRECpGPos, CpG=DNAString("CG"))
Dear Martin, Thank you so much! The vectorized function works like a charm. But I still need to use bpvec to boost the speed (.vRECpGPos took ~40s). bpvec is about 35% faster than bplapply.
We only start builds when the Version
field in the DESCRIPTION
file is incremented. For example, by changing
Version: 0.99.0
to
Version 0.99.1
If you did not intend to start a build, you don't need to
do anything. If you did want to start a build, increment
the Version:
field and try again.
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "skipped, WARNINGS, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161113164731.html
Received a valid push; starting a build. Commits are:
7fa7616 0.99.2
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161113202441.html
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "WARNINGS". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161113210450.html
Received a valid push; starting a build. Commits are:
5e686d2 0.99.4
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "WARNINGS". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161114010639.html
Received a valid push; starting a build. Commits are:
f7db1a8 0.99.5
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161115063114.html
Hi,
I have revised my package as commented above . I had no build error and passed all checks in my machine but it said build error here.
I need to use some annotation or data packages like IlluminaHumanMethylation450kanno.ilmn12.hg19 in my package. So I used requireNamespace and :: in my code, as recommended. It works smoothly in my machine but I don't quite understand what the error message means and why the server cannot find the package. I have tried multiple ways but no luck. Could you please advise?
Also in my previous request I submitted a linked data package. But now it is no longer necessary thanks to AnnotationHub. It can be removed.
By the way, In my previous attempts, couple of packages used in my package was built on 3.3.2 but in the server they could be still in 3.3.1, which could lead to check warnings. Is there anything I can do to avoid this kind of warning?
Thanks!
I think your approach is correct but that minfi requires a bug fix; I suggest that you move the annotation package to the Depends: field of the DESCRIPTION file until the minfi package is fixed. Please do not worry about the use of 3.3.1 / 3.3.2, the single package builder is in the process of being updated.
Received a valid push; starting a build. Commits are:
4f95c87 0.99.6
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "ABNORMAL". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161115224608.html
The "abnormal" is an error on our end from updating the single package builder. Once the update is complete I will manually kick off a build of your package
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161116093213.html
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "ABNORMAL". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMPdata_buildreport_20161116093416.html
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161116095229.html
Received a valid push; starting a build. Commits are:
af4726b 0.99.7
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161120054730.html
Hi @mtmorgan ,
I don't quite understand how to fix current errors (http://bioconductor.org/spb_reports/REMP_buildreport_20161120054730.html). Could you please take a look at them? Before this submission, I have my code passed R CMD check with 0 error 0 warning and 2 notes in my PC, Mac, and Linux server.
Thanks.
It seems there are some issues when AnnotationHub is trying to create a local cache in the build server ('Permission denied').
Could you please help with the issues? Thanks a lot.
We tried installing some system dependency on our end. I will manually kick off a new build to see if it helps any of the ERRORS.
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "WARNINGS, skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161128113518.html
@lshep Thank you very much!
But the permission denied issue still exists in Windows:
Warning in dir.create(cache, recursive = TRUE) : cannot create dir 'C:\Users\Default\Documents\AppData', reason 'Permission denied' Warning in dir.create(dirname(cachepath), recursive = TRUE) : cannot create dir 'C:\Users\Default\Documents\AppData', reason 'Permission denied' Warning in file.create(to[okay]) : cannot create file 'C:/Users/Default/Documents/AppData/.AnnotationHub/annotationhub.sqlite3', reason 'No such file or directory'
I think "dir.create(cache, recursive = TRUE)" is from AnnotationHub. System denied to create cache to store AnnotationHub database and caused the downstream directory not found error.
For Linux and Mac:
Can I ignore the warning:
For Mac:
caught segfault address 0x7fef50b879e0, cause 'memory not mapped'
It think similar issue happened here when running AnnotationHub.
Anyone can help with this issue? Thank you very much.
Received a valid push; starting a build. Commits are:
f38bc0e 0.99.8
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "WARNINGS, skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161206142229.html
Received a valid push; starting a build. Commits are:
17e524a 0.99.9
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "WARNINGS, skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161206165058.html
Received a valid push; starting a build. Commits are:
5866546 0.99.10
Dear Package contributor,
This is the automated single package builder at bioconductor.org.
Your package has been built on Linux, Mac, and Windows.
On one or more platforms, the build results were: "skipped, ERROR". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.
Please see the following build report for more details:
http://bioconductor.org/spb_reports/REMP_buildreport_20161206194656.html
We only start builds when the Version
field in the DESCRIPTION
file is incremented. For example, by changing
Version: 0.99.0
to
Version 0.99.1
If you did not intend to start a build, you don't need to
do anything. If you did want to start a build, increment
the Version:
field and try again.
Update the following URL to point to the GitHub repository of the package you wish to submit to Bioconductor
Confirm the following by editing each check box to '[x]'
I am familiar with the essential aspects of Bioconductor software management, including:
For help with submitting your package, please subscribe and post questions to the bioc-devel mailing list.