Open eyrexh opened 1 year ago
Thinks I liked a lot:
Suggested Improvements:
This was derived from the JOSE review checklist and the ROpenSci review checklist.
Good job!
This was derived from the JOSE review checklist and the ROpenSci review checklist.
Please provide more detailed feedback here on what was done particularly well, and what could be improved. It is especially important to elaborate on items that you were not able to check off in the list above.
Things that were good about the project:
Suggestions for improvement:
/src/
for quick reference.This was derived from the JOSE review checklist and the ROpenSci review checklist.
Reviewer: dhruvinishar
Please provide more detailed feedback here on what was done particularly well, and what could be improved. It is especially important to elaborate on items that you were not able to check off in the list above.
model_svr.py
andrating_eda.py
could be abstracted into smaller functions instead of having it all in the main() function. This would help with a more structured way of styling the scripts.This was derived from the JOSE review checklist and the ROpenSci review checklist.
kableExtra
in the installation section, along with other packages: https://github.com/UBC-MDS/chocolate_rating/commit/9a9b8748070f6a80ece363ae854634ce1b90d0e4pandoc
according to TA feedback: https://github.com/UBC-MDS/chocolate_rating/commit/599b09c18cfaab55f36e0145c5b828bf74599a30docopt
to dependencies: https://github.com/UBC-MDS/chocolate_rating/commit/48137d6110e59bf657e348a33f921a8d09bc784d
Submitting authors: @markusnam @robindhillon1 @eyrexh @LishaGG
Repository: https://github.com/UBC-MDS/chocolate_rating Report link: https://github.com/UBC-MDS/chocolate_rating/blob/main/doc/chocolate_rating.html Abstract/executive summary: Here we attempt to build a chocolate numeric rating prediction model by evaluating Support Vector Regression (SVR) and Ridge models on chocolate-related data such as manufacturers, country of bean origin, cocoa percentage and most memorable characteristics. Our best model (SVR) performs fairly well on an unseen test data set. The mean absolute percentage error (MAPE) of SVR is 7.99% compares with 8.22% of the Ridge model. From examining the coefficients generated from the Ridge model, we found that the “raspberry” flavour characteristic and “Fruition” chocolate manufacturer have the highest positive coefficients, while the “medicinal” and “chemical” flavour characteristics have the lowest negative coefficients. The data set used in this project was compiled by Brady Brelinsk Manhattan Chocolate Society, and can be sourced here. Each row in the data set represents an observation of a chocolate product with information like manufacturer, company location, review date, country of bean origin, specific bean origin or bar name, cocoa percent, ingredients, most memorable characteristics and rating.
Editor: @flor14 Reviewer: Spencer Gerlach, Austin Shih, Dhruvi Nishar, Alexander Taciiuk