jsoma / data-studio-projects

12 stars 18 forks source link

[Project] Is Japan a safe country? #129

Open kidaemon opened 6 years ago

kidaemon commented 6 years ago

Please complete all of the following sections, or the ghost of Joseph Pulitzer will spookily dance around your issue! A completed version of this template can be found at https://github.com/kidaemon/data-studio/tree/master/code/01-disasters

Pitch

Summary

I want make a chart about natural disasters in Japan.

My data came from Fire and Disaster Management Agency

I want my image to look like this:

Details

Possible headline(s):

Data set(s): http://www.fdma.go.jp/html/hakusho/h29/h29/html/shiryo1-5-1.html

Code repository: https://github.com/kidaemon/data-studio/tree/master/code/01-disasters

Possible problems/fears/questions:

Work so far

I played with the data a little bit, and I made this image.

image

image

Checklist

This checklist must be completed before you submit your draft.

Weihua4455 commented 6 years ago

This project looks really interesting! I see one of the things you played around with is the y-axis: what should the measurements be and what should the gap be. set_yticks could be your new best friend here. Also once you are done graphing the data, it will be informative to provide some context: what was the worst disaster, did it spark any policy changes, etc.

Also just a random thought, it would be cool to give each type of disaster a color and stack them in the bar graph, something like this: stacked-bar-with-pandas

kidaemon commented 6 years ago

Update

Any changes in direction or topic?

Because the original data set only have data for disasters with more than 100 victims, I changed data set to another source. The data from Japan's Cabinet Office only covers until 2017 and after that, I scraped the website of Japan's Fire and Disaster Management Agency.

The death toll of floods looks like this. image

And the below is that for earthquakes. image

Combining with the scraped result, the death toll of floods looks like this. image

Problems/Questions

I spent so much time on scraping and data cleaning, especially regex application for Japanese language. It was a very good experience for me to be familiar with Tabula, pdf-to-text.

But, I'm running late on visualization part. Next couple of days, I'll focus on visualization and complete my work until Friday.

Checklist

SiruiZhu commented 6 years ago

The whole scraping and Dataframe looks great! It might be better to use a line graph? and some background text.

castorsia commented 6 years ago

Great work scrapping and regexing all these complicated data, and also, very interesting research question!

What I would like to see:

-How many floods/earthquakes happened by year? I need more overall context, it would help a great deal!

Overall, great work!

playfairbot commented 6 years ago

Greetings! I'm a little robot, here for a surprise inspection.

You need some feedback, let me summon @zle2105 for you

It looks like we need to fix up your your update a little bit! Edit it by clicking the pencil in the top right-hand corner. It requires:

Maybe you just didn't use the template? If not, edit your comment, cut and paste the template in, and then fill it out.

kidaemon commented 6 years ago

Final

Project visuals/text

In Japan, natural disasters are becoming the imminent threat for safety. The death toll from the natural disasters already exceeded 300 this year and it is the most disastrous in decades. The myth of Japan's safety is in dager.

graph-disasters

graph-earthquakes

graph-murders

Details

Headline:

Natural disasters are imminent threat for Japan's safety

Published website version: https://kidaemon.github.io/project-01/index.html

Code repository:

https://github.com/kidaemon/data-studio/tree/master/code/01-disasters

Final data set(s):

Disaster Management, Cabinet Office http://www.bousai.go.jp/kaigirep/hakusho/h30.html

Fire and Disaster Management Agency http://www.fdma.go.jp/bn/2018/

What did you find to be the most difficult part of this project?

Applying regular expression on Japanese language.

Are you satisfied with what you produced? Is there anything you would like to change or improve?

I'm not satisfied with the result. But, I learned how to use regex on Japanese somewhat. Next time, I'd like to plot the result on map and figure out what measures are needed in each area.

Checklist

castorsia commented 6 years ago

Absolute respect for scrapping and regexing so many data. I really like your graphs. I would consider adding y labels in your last two graphs, in order to be easier to understand what they are about without a header. Great work!

sarahslo commented 6 years ago

this is an interesting question you are asking of the data. and good for you that you have withstood all the scraping. i see why you made a separate chart for earthquakes. i feel as if these want to be a series of separate charts, organized by smallest to largest stacked on top of each other. it's hard to read a stacked bar chart like this because most of the data is not at the zero baseline.

you could even do bubble charts/graduated circles here as time series... i'd do something other than the stacked bar.

also, murder and natural disasters are not comparable. i'd drop that data unless you want to keep going and add accidents or something else.

xeophin commented 6 years ago

Since you already posted the final update, you might not continue working on that, but in case you do: maybe you could separate the natural disasters in two groups, one with "seismic" causes (i.e. volcanoes and earthquakes) and another with meteorological causes (not sure if the "other" category belongs in there, too) – as this might give an idea if there is some trend: is the ground becoming more unstable? Does climate change have an influence here? Could be an interesting follow-up question.

jlstro commented 6 years ago

Hey, looking at your final page I find it a bit hard to understand what's going on there. I think the problem is the black background color, which makes parts of the graphs' descriptions unreadable...

Really cool though that you apparently managed to use Regex with Japanese characters for this!