The following is the peer review of the project proposal by [name of team completing peer review]. The team members that participated in this review are
[Ryan Hu] - @huryan
[Chandler Naylon] - @cnaylon
[Aryan Poonacha] - @Aryan-Poonacha
[Federico Arboleda] - @fedarboleda
Describe the goal of the project.
The goal of the project is two-fold: first, to see the relationship between performing poorly in the competition and the number of episodes they appear in, and second, to examine the relationship between the number of bakers that get eliminated and the episode's viewership.
Describe the data used or collected.
The data is from TidyTuesday's October 25th challenge under the bakeoff package on the Great British Bake-off, a TV show aired in British and French. It is a little unclear the exact definition and context of the show, as the proposal mainly discusses the data and not what the show actually is. The dataset includes four dataframes with data on bakers, episodes, challenges, and ratings. These will be used in conjunction to answer the research questions, along with data manipulation to create the necessary variable age_generation.
Describe how the research question will be answered, e.g. what approaches / methods will be used.
The first research question will be answered through a scatterplot of number of episodes appeared in and whether they were bottom three, faceted by season and with a trendline. It is a little unclear what the variables are defined as though, as there is nothing in the read.me file, where the codebook should be. The second question will be answered through boxplots of number of bakers eliminated for both 7-day and 28-day viewership, faceted by season. The wording here is a little confusing when it says "for each number of bakers eliminated", as the each makes it hard to interpret.
Is there anything that is unclear from the proposal?
See what was mentioned previously.
Provide constructive feedback on how the team might be able to improve their project.
The instructions say that two separate plots are required per question. It is a little unclear what the second plots would be for each question.
What aspect of this project are you most interested in and would like to see highlighted in the presentation.
It would be interesting to see how age and generation relates to episodes appeared in and whether they were in the bottom three.
Provide constructive feedback on any issues with file and/or code organization.
As mentioned prior, it would be useful to have a clearer description of the variables. Also, clarifying whether any outside data would be merged or used could help also.
The following is the peer review of the project proposal by [name of team completing peer review]. The team members that participated in this review are
[Ryan Hu] - @huryan
[Chandler Naylon] - @cnaylon
[Aryan Poonacha] - @Aryan-Poonacha
[Federico Arboleda] - @fedarboleda
Describe the goal of the project.
The goal of the project is two-fold: first, to see the relationship between performing poorly in the competition and the number of episodes they appear in, and second, to examine the relationship between the number of bakers that get eliminated and the episode's viewership.
The data is from TidyTuesday's October 25th challenge under the bakeoff package on the Great British Bake-off, a TV show aired in British and French. It is a little unclear the exact definition and context of the show, as the proposal mainly discusses the data and not what the show actually is. The dataset includes four dataframes with data on bakers, episodes, challenges, and ratings. These will be used in conjunction to answer the research questions, along with data manipulation to create the necessary variable age_generation.
The first research question will be answered through a scatterplot of number of episodes appeared in and whether they were bottom three, faceted by season and with a trendline. It is a little unclear what the variables are defined as though, as there is nothing in the read.me file, where the codebook should be. The second question will be answered through boxplots of number of bakers eliminated for both 7-day and 28-day viewership, faceted by season. The wording here is a little confusing when it says "for each number of bakers eliminated", as the each makes it hard to interpret.
See what was mentioned previously.
The instructions say that two separate plots are required per question. It is a little unclear what the second plots would be for each question.
It would be interesting to see how age and generation relates to episodes appeared in and whether they were in the bottom three.
As mentioned prior, it would be useful to have a clearer description of the variables. Also, clarifying whether any outside data would be merged or used could help also.