JeroenKools / covid19

Data visualizations of the spread of the 2019 novel Coronavirus (COVID-19), based on data from Johns Hopkins University.
MIT License
10 stars 5 forks source link
coronavirus data-visualization jupyter-notebook

FAQ

I used this data from Johns Hopkins University Center for Systems Science and Engineering, who compiled it from data from the World Health Organization, the Chinese CDC, the US CDC and other organizations.

Of course, there could already be a lot more cases going under the radar, in people who are not being tested.

A logarithmic Y-axis makes sense for displaying quantities that can be expected to grow exponentially, i.e. with a similar percentage, over time. It allows you to judge relative growth both at the start and later on. This is unlike linear graphs where small variations early on end up at such a small scale as to be completely unnoticeable.

In addition, this allows for better comparison between different series (i.e. countries) that are at different stages. In a linear graph you can't show both a series that is at 100,000 and another at 100 and have both convey meaningful information.

That said, it's easy enough to change this plot to be linear. I included an example in the notebook.

I didn't want to graph a hundred countries in one plot, that would be too messy. The easiest way to customize the displayed set of countries is opening it in Google Colab, and change the list of countries to include in the interactive plot.

Just some of the countries with the highest number of confirmed cases, plus some I have a personal connection to.

No. I'm not passing judgment on anyone, or trying to tell anyone to do anything, I'm just looking at data.