Closed howardbaek closed 11 months ago
:wave: @howardbaek, thanks for your submission!
We're wondering about overlap (https://devguide.ropensci.org/policies.html#overlap) with the audio.whisper package, that seems easier to install since it doesn't require any external software (https://ropensci.org/blog/2021/09/13/system-calls-r-package/#downsides-of-system-commands). Could you please comment on that?
I didn't know about those policies. I can close this issue.
Submitting Author Name: Howard Baek Submitting Author Github Handle: !--author1-->@howardbaek<!--end-author1-- Repository: https://github.com/howardbaek/audrey Submission type: Pre-submission Language: en
Scope
Please indicate which category or categories from our package fit policies or statistical package categories this package falls under. (Please check an appropriate box below):
Data Lifecycle Packages
[x] data retrieval
[ ] data extraction
[ ] data munging
[ ] data deposition
[ ] data validation and testing
[ ] workflow automation
[ ] version control
[ ] citation management and bibliometrics
[x] scientific software wrappers
[ ] field and lab reproducibility tools
[ ] database software bindings
[ ] geospatial data
[ ] text analysis
Statistical Packages
[ ] Bayesian and Monte Carlo Routines
[ ] Dimensionality Reduction, Clustering, and Unsupervised Learning
[ ] Machine Learning
[ ] Regression and Supervised Learning
[ ] Exploratory Data Analysis (EDA) and Summary Statistics
[ ] Spatial Analyses
[ ] Time Series Analyses
Explain how and why the package falls under these categories (briefly, 1-2 sentences). Please note any areas you are unsure of:
This package interacts with the OpenAI Whisper by running
system2()
. Not sure if this qualifies as scientific software wrapper or data retrieval?Anyone who wishes to use OpenAI's Whisper model to perform automatic speech recognition and translation into English.
This package, https://github.com/bnosac/audio.whisper, does the same thing and is an Rcpp wrapper around the whisper.cpp C++ library. This package is based on a CPU-only inference engine written in C++11, so no external software is needed and you can directly install and use it from R.
My package does require external software to be installed (Whisper + ffmpeg), but doesn't have any Rcpp and simply uses
system2()
to access OpenAI's Whisper model.pkgcheck()
keeps on telling me to "avoid long code lines, it is bad for readability.", but I'm not sure how to make code lines shorter automatically.Also, I'm not sure how to fix these
pkgcheck()
messages: