Open ttimbers opened 7 months ago
My suggestions would be:
Find a way to specify input type constraint for the script. In python we were able to do it by for example: @click.option('--test_data_path', help='path of test set data (csv) to read', type=str) I am not sure if you would be able to do the same in R but it is a good way to help reproducability.
If there is no way to specify input type, I would recommand try to add a test for input type check
Can consider to let users to change hyperparameters tuning. for example, let useres choose number of fold for cross validation as input argument of the script.
This was derived from the JOSE review checklist and the ROpenSci review checklist.
My suggestions are:
This was derived from the JOSE review checklist and the ROpenSci review checklist.
Submitting authors: Sid Ahuja, Zackarya Hamza, Alexander Dawson
Repository: https://github.com/DSCI-310-2024/DSCI-310-Group-18_wine-quality-predictor/releases/tag/version2.0.0
Abstract/executive summary:
In this project, we build a prediction model using the k-nearest neighbours algorithm which attempts to categorize the quality of a wine based on its' physiochemical properties. We classify wine quality into a binary category: whether it is good or bad. Our classifier performed moderately well on the test set, but further research must be done to improve the model before it is put into production.
The dataset that we used for this project is about white variants of the Portugese "Vinho Verde" wine, which was assembled by Paulo Cortez, A. Cerdeira, F. Almeida, T.Matos, and J.Reis. The dataset was sourced from UCI Machine Learning Repository (Dua and Graff 2017), located here. Each row in this dataset showcases an observation of a white wine, specifically related to its physicochemical and sensory attributes.
Editor: @ttimbers
Reviewer: Zhibek Dzhunusova, Prithvi Sureka, Peter Chen