Open WilfHass opened 1 year ago
Please provide more detailed feedback here on what was done particularly well, and what could be improved. It is especially important to elaborate on items that you were not able to check off in the list above.
This was derived from the JOSE review checklist and the ROpenSci review checklist.
Please provide more detailed feedback here on what was done particularly well, and what could be improved. It is especially important to elaborate on items that you were not able to check off in the list above.
This was derived from the JOSE review checklist and the ROpenSci review checklist.
2.5 hours (including the time to write this review and build the code and run the analysis)
First of all, I really enjoyed reading your report and I think you did a great job. I have a few comments and suggestions that I hope will help you improve your report and overall git repository hygiene (see General Comments below)
This was derived from the JOSE review checklist and the ROpenSci review checklist.
total
is dropped because it is just merely the sum of other numeric features. Given other numeric features are highly correlated, is it possible to do classification just by it own, or include it as it is, which might end up with total
having a dominant coefficient. Some more clarification of why this column is dropped would make report even better. This was derived from the JOSE review checklist and the ROpenSci review checklist.
Add reason for choosing specific models (in report and README) (commit link)
Updating dependency list (commit link)
Add more information about hyperparameter tuning in report (commit link)
Show more EDA plots in the final report (commit link)
Add more details in the README.md about the features of the data (commit link)
Add reason why we chose accuracy as our scoring metric (commit link)
Add links to license, code of conduct (commit link)
Submitting authors: @WilfHass @carolinetang77 @missarah96 @vincentho32
Repository: https://github.com/UBC-MDS/pokemon-type-predictor Report link: https://github.com/UBC-MDS/pokemon-type-predictor/blob/main/doc/final_report.md Abstract/executive summary: In this project, we attempt to build a classification model using two algorithms: - Nearest Neighbours and a Support Vector Machine. We will use this classification model to classify a Pokemon's type (of which there are 18 possible types) based on the other stats (such as attack, defense, etc.) that it has. We use accuracy as the metric to score our models since there is no detriment to false positives or negatives, but we do want to know how many of the unknown Pokemon will be predicted correctly. On the unseen test data, the -NN model predicted 60% of the new Pokemon correctly while the SVC model predicted 67% correctly. Since these are not very accurate results, we recommend trying different estimators to fill up that Pokedex!
The data is found here. The data was cleaned by HansAnonymous and originally developed by simsketch. The original data can be found in the Pokemon database. All rights belong to their respective owners. Each row in the dataset contains a different Pokemon with various attributes. The attributes are measurements of the base Pokemon, such as attack, speed or defense.The different types of Pokemon are closely related to the other attributes it possesses. For example, a rock type Pokemon is likely to have higher defensive statistics (such as defense or health points) as well as rock-type abilities. It is also most likely to be coloured grey.
Editor: @flor14 Reviewer: Revathy ponnambalam, Ashwin Babu, Zilong Yi, Tony Zoght