the goal of the project is to investigate the links between various explanatory variables (information about passengers) and the outcome variable (survival).
Describe the data used or collected.
the original source of the data is said to be unknown however its thought to have been derived from official inquirries launched in the UK and USA soon after the tradegy.
Unfortunately there seems to a lot of NA.
Good pick as realistically a good number of the variables will be related to the survival rate.
Describe how the research question will be answered, e.g. what approaches / methods will be used.
in the proposal there's mostly ggplot graphing.
Could add some linear regression modeling, perhaps accounting for interaction between age and class, id think the rest wouldn't be that significant.
Is there anything that is unclear from the proposal?
wasn't able to understand the last graph, is the weight the average fair price of from of a certain class who died/survived.
Provide constructive feedback on how the team might be able to improve their project.
would be interesting to explore the survival rates of different floors and potentially how this could be made bias as it seems those who died and were probably lower, are more likely to have missing data.
What aspect of this project are you most interested in and would like to see highlighted in the presentation.
survival by age, could maybe look into the error margin, would have thought children would be more likely to survive.
Provide constructive feedback on any issues with file and/or code organization.
the goal of the project is to investigate the links between various explanatory variables (information about passengers) and the outcome variable (survival).
the original source of the data is said to be unknown however its thought to have been derived from official inquirries launched in the UK and USA soon after the tradegy. Unfortunately there seems to a lot of NA. Good pick as realistically a good number of the variables will be related to the survival rate.
Describe how the research question will be answered, e.g. what approaches / methods will be used. in the proposal there's mostly ggplot graphing. Could add some linear regression modeling, perhaps accounting for interaction between age and class, id think the rest wouldn't be that significant.
Is there anything that is unclear from the proposal?
wasn't able to understand the last graph, is the weight the average fair price of from of a certain class who died/survived.
would be interesting to explore the survival rates of different floors and potentially how this could be made bias as it seems those who died and were probably lower, are more likely to have missing data.
survival by age, could maybe look into the error margin, would have thought children would be more likely to survive.
organisation looked good