Open gfairbro opened 2 years ago
Couldn't find Victor or Linh Giang's github handle!
@gn385x is Linh Giang but i cannot assign her.
Overall the project is executed well and there is a good flow to the report. It is concise and summarize the project well. The report clearly states the objective, analysis, methodology used for modelling, results as well as limitations. Awesome job on building the jupyter book for final report.
A few suggestions:
This was derived from the JOSE review checklist and the ROpenSci review checklist.
What was done well:
What could be improved:
Please provide more detailed feedback here on what was done particularly well, and what could be improved. It is especially important to elaborate on items that you were not able to check off in the list above.
This was derived from the JOSE review checklist and the ROpenSci review checklist.
Overall, I think this is great work - kudos team!
Generally, an outstanding project you got here! Well-done.
This was derived from the JOSE review checklist and the ROpenSci review checklist.
Very nicely done! Your group really went above and beyond the minimum requirements and weren’t afraid to use more complex tools. I found that your ideas were communicated effectively in the written sections and code was well documented and easy to read. I can see that you have already incorporated much of the feedback received from other reviewers and it was not easy to find areas for improvement.
Stood out
Areas for improvement
report summary
given that there is another file in the wine_quality_predictor_report with the exact same name. You may want to change the name of your report to something that clearly marks it as the full and final version.This was derived from the JOSE review checklist and the ROpenSci review checklist.
Hello group9! Yesterday I spend some time with all the groups present in lab1 providing some suggestions on how to improve the report. I think you were online, so I leave here some minimal comments:
1. Comments from Kylemaj on CONTRIBUTING.md file
Commit addressing the comments: https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/9dc54f947b000ef3cc924db09d0b415a9d7396a6
2. Comments from Kylemaj, gn385x and Vikiano on juputer notebook structure
Commits addressing the comments: https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/c79a016dd9e0cc0a0f5210b4c4b0810d04d328c4 https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/7405529ead6b01df4c60a4cc17b817c245e0cb12
3. Comments from manju-abhinandana on file organization
Commits addressing the comments: https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/698e791add9e15a47b180ec345971a26d6e9b667 https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/13767b692661ab5085bc27ea20accfd3a19e423b https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/fc20795a0f8b854a7d7522cc618553a4b4704e33
4. Comments from manju-abhinandana, Vikiano, and Kylemaj on contents of report
Commits addressing the comments: https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/57d23b2b8799b07cbf043ad71c7d012946cf05da https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/7405529ead6b01df4c60a4cc17b817c245e0cb12 https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/1b6b91ad6cbe26bf3c35b3a790c708439ac68fbb https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/bd50e1aacf21e4fe547866ac7081b953ae83d245 https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/d68aff8b25098345aaef4d6fa0039e67bdc04be7
5. Addressing TA's feedback on Milestone 2 release:
try
, except
everywhere instead of checking if a file exists when outputting (and maybe reading) the files. You could instead check if the file exists and create it if it doesn't. Using try
except
in production except for when dealing with 3rd party stuff is not a good practice.Commits addressing the comments: https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/commit/2dee29a5c4941d463f0664748b65ac28a579ea51
if
statement to check if a file exists before exporting it, and we adopts for
loops to make our codes DRY.
Submitting authors: gfairbro, paradise1260, Luming-ubc, GWYY
Repository: https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor Report link: https://htmlpreview.github.io/?https://github.com/UBC-MDS/DSCI_522_group09_Wine_Quality_Predictor/blob/main/reports/wine_quality_predictor_report/_build/html/report_summary.html Abstract/executive summary: Wine is a product that is both an extremely popular and highly consumed product, and one that can be very expensive to buy and lucrative to sell. It is also sold at much higher variety levels than almost any other consumer product - in some supermarkets well over 1000 different wines are stocked.Lockshin, 2003
At the same time, it is also one of the hardest to identify quality ahead of purchase, since you must consume it to decide. The level of quality a consumer might require can even vary wildly depending on the consumption occasion. P. G. Quester and others.
The quality of wine however is difficult to evaluate objectively and is reliant on some very subjective sensory elements. However we believe that this question can be answered by evaluating which physicochemical features are important in determining the quality score of a wine, the wine manufacturers can refine certain wine-making procedures that may yield wines with "promising" properties.
We also believe that by using a quality score that is a human taste output (i.e. each quality score is a median taken over a minimum of 3 sensory assessors) instead of following an objective and rigid standard, which makes wine certification a complicated task, we can better capture the inherent subjectivity of the task. Therefore, attempting to unravel the relationship between physicochemical properties and human taste sensations may also be a direction in the wine certification field Cortez and Others
The data sets were sampled from the red and white vinho verde wines from the North of Portugal, created by P. Cortez, A. Cerdeira, F. Almeida, T. Matos and J. Reis (2009). The data sets were sourced from the UC Irvine Machine Learning Repository and can be found here. One data set is for the red wine, and the other is for the white wine, and both data sets have the same features and target columns. Each row represents a wine sample with its physicochemical properties such as fixed acidity, volatile acidity, etc. The target is a score (integer) ranging from 0 (very bad) to 10 (excellent) that represents the quality of the wine.
Editor: @flor14 Reviewer: Maj_Kyle, Neervaram Abhinandana Kumar_Manju, Nguyen_Jiang, Francis_Victor