jsoma / data-studio-projects

12 stars 18 forks source link

Beijing's air was notoriously bad a few years ago. How is it doing now? #153

Open ksliney opened 6 years ago

ksliney commented 6 years ago

Pitch

Are local government efforts to Improve Beijing's air quality successful? Let's compare 2008 through 2017.

Summary

I want to plot the air quality in Beijing, China from 2008-2017. (Hint: It used to be bad! I want to know if it still is.) My data came from the US Embassy.

Details

Data set(s): http://www.stateair.net/web/historical/1/1.html Code repositoryhttps://github.com/ksliney/data-studio/tree/master/code

Possible problems/fears/questions: I want to better contextualize what these air polution numbers mean, in terms of impact to health. Data for 2008 and 2017 is incomplete, so the chart is not as up-to-date as I would like.

Work so far

UPDATE: I reworked the data, and found the average air quality reading per day, and then sorted each day by its level of small particle pollution and danger to public health.

screen shot 2018-07-14 at 2 59 12 am

You can see the interactivity here: img_2312

I found the average monthly air quality reading for the year 2008, and then did this for every year, though 2017.

screen shot 2018-07-12 at 1 43 30 pm

Older work:

screen shot 2018-07-12 at 10 55 19 am screen shot 2018-07-11 at 12 33 20 pm

UPLOAD SCREENSHOTS

Checklist

This checklist must be completed before you submit your draft.

ksliney commented 6 years ago

UPDATE: Is Beijing's Air Quality Really Improving?

screen shot 2018-07-16 at 12 41 19 pm

Any changes in direction or topic?

I re-worked the data to average AQI readings for each day (rather than monthly), for 2008 through 2017. I made the chart using google charts, which is interactive, though the static screenshot (hopefully) tells the same story just as well.

Problems/Questions

*When cleaning the data, I dropped any values in the dataset where the AQI detector errored and returned a negative reading. I noticed that many of the AQI reading errors I dropped were sampled during the winter months, when coal consumption is high and pollution is typically at its worst in Beijing. So the updated chart may misrepresent pollution levels due to lack of data and reliance on one detector for AQI values.

*If I were to re-work the data again, I would plot the AQI at a certain time of day (say, 8:30 AM, or 5:30PM) instead of plotting the daily average.

Checklist

xeophin commented 6 years ago
jsoma commented 6 years ago
playfairbot commented 6 years ago

Greetings! I'm a little robot, beep beep boop boop.

You need some feedback, let me summon @jessimckenzi, @Yuanqi-Hong, @Katerinavts for you

It looks like we need to fix up your your update a little bit! Edit it by clicking the pencil in the top right-hand corner. It requires:

Maybe you just didn't use the template? If not, edit your comment, cut and paste the template in, and then fill it out.

jessimckenzi commented 6 years ago

This is fascinating! I do think the use of blue/grey is a very effective use of color for storytelling, but agree with Soma that the lines between the colors should go. I also find the grid lines distracting, and would prefer the graph without.

But the content is really interesting! Would love to know why the info about 2017 isn't available—is it just too soon to have been released? Last thought: If I'm not a very data literate reader, I might not know what to make of the graph, and you don't explain. Maybe the title should explain the graph: "Beijing's air quality improved between 2014 and 2016" or similar

Katerinavts commented 6 years ago

Good work! My only suggestion would be to narrow down the legend to 3 or 4 categories of air quality.

Yuanqi-Hong commented 6 years ago

I'm honestly interested in this pitch. I lived in Beijing (2014 - 2018), and the data for 2017 is really counterintuitive for me! People (classmates and colleagues) in Beijing were talking about how 2017 was a better year in terms of air quality compared to previous ones. I googled the topic and articles like this BBC story came up. I'm curious to know why the discrepancy.

ksliney commented 6 years ago

UPDATE 2: https://ksliney.github.io/airpolution/

screen shot 2018-07-25 at 3 25 50 pm

UPDATE: Lots of time spent researching solutions to recommendations, and a few changes made

screen shot 2018-07-22 at 3 45 51 pm

Here's an updated link to my notebook and data.

Any changes in direction or topic?

It turns out, I was interpreting the data incorrectly. The story actually is a bit bleaker than I realized for small particle pollution only. Because of this, you'll notice the number of blue "good" days has gone down dramatically in this later draft.

Problems/Questions

Unfortunately I am not yet able to answer my question regarding 2017. The US embassy's data for 2017 is (1) simply not present for all months (i.e., those months are not included in the data set), and (2) for the months that are present, there are days within those months that returned error readings that I had to drop from the data. So there's too much missing from 2017 to draw any big conclusions.

I hear all of you on there being too many color categories. These correspond to the EPA's own categories, but I realize that doesn't mean terribly much to a reader. I'm trying to figure out how much to convey, without simplifying too much or recreating the same table the EPA puts out. So that will be next.

Checklist

collleenwang commented 6 years ago

Excellent work! To make the y axis to see what happened in 365 days is very good idea! I think the blue part can be more brighter, so that we can see more comparable results.

jlstro commented 6 years ago

Hey, I know that all our instructors say that we shouldn't rely on readers interacting much, but I still like it a lot when something moves and changes and acts like it was interactive :-) Coll that you got it to work. There are a few minor things I'd do differently with regard to your final page (even though it is too late now, I guess?): It looks like the diagram does not align correctly, so it looks a bit weird postion-wise. Also, I have no idea what PM 2.5 pollution levels according to EPA standards means. I assume that each segment of each year is a sum of the days that fall into the respective category, but other than that, a subheadline explanation would help a lot. Your color scheme is cool, it really reminds me of dirty, dirty air.