Closed bcjaeger closed 2 years ago
Hi, @jooolia. Thanks for helping me get this pre-submission started. Is there anything I can do to help with the pre-submission tasks?
Hi @bcjaeger, Thanks for your patience. It seems that the machine learning category would be a good fit for your package and the package appears in good shape to make a full submission when you would like.
Regarding the categories, @mpadge may have a bit more to add about this.
Thanks, Julia
Thank you!
Thanks for your submission @bcjaeger Our statistical standards are a work-in-progress. Please help to improve them by providing feedback, particularly on appropriateness or otherwise of any particular standard. That can be done informally via GitHub discussions, or more formally via a pull request to that same repo (standards are here). We're also very keen to develop policies on handling cases of potentially ambiguous categories, such as yours. To help that process, I've started this discussion thread - please offer any insight you can. Thanks!
Great thanks @mpadge.
I will close this issue and we look forward to the full submission. Thanks, Julia
Submitting Author Name: Byron C. Jaeger Submitting Author Github Handle: !--author1-->@bcjaeger<!--end-author1-- Other Package Authors Github handles: (comma separated, delete if none) @nmpieyeskey, @sawyerWeld Repository: https://github.com/bcjaeger/aorsf Submission type: Pre-submission Language: en
Scope
Please indicate which category or categories from our package fit policies or statistical package categories this package falls under. (Please check an appropriate box below):
Data Lifecycle Packages
[ ] data retrieval
[ ] data extraction
[ ] database access
[ ] data munging
[ ] data deposition
[ ] workflow automation
[ ] version control
[ ] citation management and bibliometrics
[ ] scientific software wrappers
[ ] database software bindings
[ ] geospatial data
[ ] text data
Statistical Packages
[ ] Bayesian and Monte Carlo Routines
[ ] Dimensionality Reduction, Clustering, and Unsupervised Learning
[X] Machine Learning
[ ] Regression and Supervised Learning
[ ] Exploratory Data Analysis (EDA) and Summary Statistics
[ ] Spatial Analyses
[ ] Time Series Analyses
Explain how and why the package falls under these categories (briefly, 1-2 sentences). Please note any areas you are unsure of:
Random forests are a machine learning algorithm and this package provides optimized code to fit a specific type of random forest. I am unsure about whether this belongs in the machine learning category or the regression and supervised learning category. I am uncertain about whether
aorsf
belongs in regression and supervised learning because random forests are definitely used for supervised learning but they don't really fit into a 'regression' framework.Yes
Target audience: people who want to develop or interpret a risk prediction model, i.e., a prediction model for right-censored time-to-event outcomes.
Applications: fit an oblique random survival forest, compute predicted risk at a given time, estimate the importance of individual variables, and compute partial dependence to depict relationships between specific predictors and predicted risk.
The
obliqueRSF
package precedesaorsf
. Theaorsf
package runs hundreds of times faster thanobliqueRSF
and includes novel features for interpretation of the oblique random survival forest (negation importance and ANOVA importance). I developed both packages.Yes.
None.