DSCI-310-2024 / data-analysis-review-2024

2 stars 0 forks source link

Submission: Group 18: Wine Quality Predictor #18

Open ttimbers opened 6 months ago

ttimbers commented 6 months ago

Submitting authors: Sid Ahuja, Zackarya Hamza, Alexander Dawson

Repository: https://github.com/DSCI-310-2024/DSCI-310-Group-18_wine-quality-predictor/releases/tag/version2.0.0

Abstract/executive summary:

In this project, we build a prediction model using the k-nearest neighbours algorithm which attempts to categorize the quality of a wine based on its' physiochemical properties. We classify wine quality into a binary category: whether it is good or bad. Our classifier performed moderately well on the test set, but further research must be done to improve the model before it is put into production.

The dataset that we used for this project is about white variants of the Portugese "Vinho Verde" wine, which was assembled by Paulo Cortez, A. Cerdeira, F. Almeida, T.Matos, and J.Reis. The dataset was sourced from UCI Machine Learning Repository (Dua and Graff 2017), located here. Each row in this dataset showcases an observation of a white wine, specifically related to its physicochemical and sensory attributes.

Editor: @ttimbers

Reviewer: Zhibek Dzhunusova, Prithvi Sureka, Peter Chen

petercmh01 commented 5 months ago

Data analysis review checklist

Reviewer:

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 1.5

Review Comments:

My suggestions would be:

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.

zhibekD commented 5 months ago

Data analysis review checklist

Reviewer: zhibekD

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 2

Review Comments:

My suggestions are:

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.