Closed e-kotov closed 3 months ago
Hi @e-kotov, thanks for the pre-submission. We are discussing with the editors. Thanks, Julia
Dear @e-kotov, Thank you very much for your submission. Your package {rJavaEnv} looks very useful and helpful for many use cases, however we believe for us it is out-of-scope for the workflow category. The category is for “tools that automate and link together workflows, such as build systems and tools to manage continuous integration” and your package is even more general than the packages that we aim to support here. Many thanks again for your pre-submission and I am looking forward to testing out your package in the future.
Many thanks, Julia
Dear @jooolia , thank you and the editors for taking the time to review {rJavaEnv}
.
Submitting Author Name: Egor Kotov Submitting Author Github Handle: !--author1-->@e-kotov<!--end-author1-- Repository: https://github.com/e-kotov/rJavaEnv Submission type: Pre-submission Language: en
Scope
Please indicate which category or categories from our package fit policies or statistical package categories this package falls under. (Please check one or more appropriate boxes below):
Data Lifecycle Packages
[ ] data retrieval
[ ] data extraction
[ ] data munging
[ ] data deposition
[ ] data validation and testing
[x] workflow automation
[ ] version control
[ ] citation management and bibliometrics
[ ] scientific software wrappers
[ ] field and lab reproducibility tools
[ ] database software bindings
[ ] geospatial data
[ ] text analysis
Statistical Packages
[ ] Bayesian and Monte Carlo Routines
[ ] Dimensionality Reduction, Clustering, and Unsupervised Learning
[ ] Machine Learning
[ ] Regression and Supervised Learning
[ ] Exploratory Data Analysis (EDA) and Summary Statistics
[ ] Spatial Analyses
[ ] Time Series Analyses
[ ] Probability Distributions
Explain how and why the package falls under these categories (briefly, 1-2 sentences). Please note any areas you are unsure of:
Java
/rJava
-dependent R packages, which is essential for maintaining reproducibility and efficiency in scientific workflows. It may also be used in tandem with other workflow automation packages such as{targets}
. E.g. one can use R packages that require different Java versions in the same project, because{targets}
runs all steps in a separate R processes. Hence addingrJavaEnv::java_env_set("java_path", "session")
(see docs) in the beginning of a particular script invoked by{targets}
will use a different Java version without conflicting with any other step of the workflow.If submitting a statistical package, have you already incorporated documentation of standards into your code via the srr package?
Who is the target audience and what are scientific applications of this package?
Java
/rJava
-dependent R packages on CRAN and Bioconductor (you can find my brief overview here) and very often users face issues with getting the right Java version installed. Popular packages include {r5r
},{opentripplanner}
,tabulapdf}
, multiple text analysis packages such as {openNLP}, {mallet}, and others.Java
/rJava
-dependent R packages, who can use{rJavaEnv}
for Java set-up instructions to use their packages. Some have already tested the pre-release version of{rJavaEnv}
in a workshop with 35-40 people and were happy with the results, others have even contributed to{rJavaEnv}
.Are there other R packages that accomplish the same thing? If so, how does yours differ or meet our criteria for best-in-category?
sdkman
, however these tools do not work in the same way on all platforms, e.g.sdkman
can only work on Windows with WSL or MSYS+MinGW, that is, it will not work out of the box for most users without additional complicated setup which often requires admin privileges. What is more, these general purpose tools set Java version globally. The approach of{rJavaEnv}
to providing a safe per-project installation of Java is currently detailed in this comment in the package Issues (to be converted into a separate vignette).{rJavaEnv}
works on all three major platforms and does not require admin privileges. Also, it does not pollute the user's system with system-wide Java installs and always only impacts the current project/working directory and only for R projects.(If applicable) Does your package comply with our guidance around Ethics, Data Privacy and Human Subjects Research?
Any other questions or issues we should be aware of?:
{rJava}
package from source and therefore require some Java to be present for that. I am in the process of addressing these issues before the full submission to rOpenSci.