Open NickSadjoli opened 4 years ago
So far, we've determined that Population, Population Density, and Age Median seems to have low correlation and thus not affecting the amount of Confirmed Cases and Fatalities much.
However, with the recent news and observations on how some countries managed to curb the rising amount of cases and fatalities, it seems that the following will be needed to be considered and might be the features that we'll need to look into:
I'd say that this is quite an obvious data to consider, since more tests conducted would definitely mean more confirmed cases will be reported daily.
However, i'd also note that this means that if we're using such data, there are things to note, as can be seen below (Imo):
I'll take a read or watch some more videos on the COVID-19 spread to have a better idea of what kind of features for us to find after this, and hopefully I could find something good.
What's your opinions of this, @josephinemonica ? Any of your opinions or other suggestions would be very welcome and very appreciated.
EDIT: Formatting
To consolidate the types of data and sources we need to consider, based on discussions with @josephinemonica:
Default features to use and check:
Features to check correlation:
Other considerations based on TED talk with Bill Gates:
Need additional data sources compared to the one listed at Kaggle. Current other source would be the Worldometer site. However additional sources with other types of data would be most welcome.