jsoma / data-studio-projects

12 stars 18 forks source link

Tracking forest fires in Europe #174

Open castorsia opened 6 years ago

castorsia commented 6 years ago

Please complete all of the following sections, or the ghost of Joseph Pulitzer will spookily dance around your issue! A completed version of this template can be found at https://github.com/jsoma/data-studio-projects/issues/1

Pitch

My intent: to track (forest) fires in the European Union and in Greece, specifically. For that, I use data from the European Forest Fire Information System EFFIS. I want to find out: the countries most affected, recent trends for the Mediterranean, compare severity through burned areas.

Summary

Details

**Possible headline(s): Fire season is already here: A look at the data

**Data set(s): At the moment, 3 datasets are used, featuring: 1) The number of fires for 2017, for every country 2) The yearly number of fires per country since 1980 3) The total burned areas per country per year

Code repository: (https://github.com/castorsia/data-studio/commit/3bf4df5063addd8312e96001d7ce277bd362de2b)

Possible problems/fears/questions:

I tried using a personal style that I have created... With tragic results in the x axis.

Work so far

Cleaned up my datasets, dropped non-EU countries, investigated the huge amount of Ukraine and Russia fires, and then started a lacklustre plotting..

10 most impacted countries for 2017

download

Checklist

This checklist must be completed before you submit your draft.

jsoma commented 6 years ago
castorsia commented 6 years ago

Update

Your project content: images/words/etc

2018-07-20 00_49_25-project_2_v_0-copy1

Something terrible must have happened to Portugal in 2017. It was this: (https://www.bbc.com/news/world-europe-41634125)

elaaaa_1

Any changes in direction or topic?

-Focused on Portugal and the fires that ripped the country apart. -Added heading and subhead -Changed the colours -Different font -dropped the tick marks

Problems/Questions

Can not get a specific column and save it as a new dataframe (say, Portugal, featuring years and burnt areas). I get an out of index error. Have to go around it... Get two dfs and concat them. -Dead space in the second line graph. I want to loose it!

Code: (https://github.com/castorsia/data-studio/commit/e10a0a663fc538d410e492af4373e793afa4a68a)

Checklist

playfairbot commented 6 years ago

Greetings! I'm a little robot, checking in on your project.

You need some feedback, let me summon @cfelke, @dz2383, @vpenney for you

vpenney commented 6 years ago

You can actually omit the parenthesis when you list the number of hectares that burned in your bar chart--I initially read that as "ha" as in "haha," but forest fires aren't very funny. Alternatively, since you define the measure in the title, you can omit "ha" in your bars entirely, or use a simple legend like you use in your line graph.

Your line graph could be an interesting opportunity to use some of that blank space in the graph to explain the peaks and troughs in the graph (like what Bui talked about). That would probably come later, once you get everything into Illustrator, but it would be interesting to know if those spikes follow a drought cycle or if there were big fires in those peak years that drove up the number of burned acres.

cfelke commented 6 years ago

Wow, what's going on in Portugal? I really like your second graph. It would be great to have some annotations in your line graph, especially for 2003(?) and 2005 (are the peaks related to the same reason? If so, you could make one text field with two arrows pointing at both years) and definitely for 2017. Regarding the empty space you're not satisfied with, you can fill it up (Angela did this in her first project e.g. https://github.com/jsoma/data-studio-projects/issues/138)

Regarding the first graph, if you're already displaying the 5 countries with the biggest losses of forests I don't think you need to put this in your title as well. I'd ditch the country abbreviations on your y axis and choose a brighter colour for your values, maybe orange (= fire, wooh!), in fact anything but black. I'm not sure if you need to put the ha next to your values, if you're already labeled your x-axis with ha and it's in the title too. Maybe ditch one of those. Moreover, some x grid lines for orientation might be helpful, give it a try.

dz2383 commented 6 years ago

I love the topic! For improvements: could you get rid of the black lines in two graphs( bottom and right side)? And add grid at the background for the line graph? Because now it's hard to read the number.

castorsia commented 6 years ago

Update

Your project content: images/words/etc

I struggled to find a good angle in approaching my data, because, for example, forest hectares burnt, is just a number. How it can be translated to patterns? Are 29.000 hectares of forest burned, not important, compared with bigger fires? So, I took a jab in providing some more graphs with info. STILL TO DO: Context for Portugal and Greece and Iberian wildfire spikes.

2018-07-27 01_42_06-project_2_rev_2 2018-07-27 01_57_33-project_2_rev_2 2018-07-27 01_58_08-project_2_rev_2 2018-07-27 01_58_30-project_2_rev_2

Any changes in direction or topic?

There's a real need to look into country-specific conflagrations-and their reasons.

Problems/Questions

-I must find more creative ways to communicate the data. I have the same graphs over and over again. -Imperative to find a way to visualise the scale of the forest fire problem. -I will compare Greek fire service data with the European Union.

Checklist

mattrehbein commented 6 years ago

Personally, I think simply showing which countries have experienced/are experiencing a greater frequency in fires is interesting. That information certainly begs the question of why for the individual countries that have seen great increases, but getting into why may be a lot for the scope of this project. Are you looking for drought data or something similar if you're trying to get to the cause?

This may be a silly question but just to be clear about your terminology: Your focus is forest fires -- so we're only looking at fires in wooded natural areas? Is this the same as wildfires? Do they threaten inhabited areas, destroy any buildings or cause fatalities?

Practical, nit-picky things about the graphs that you're probably already working to fix:

maxarvid commented 6 years ago

Really interesting project, and something that will only become more important as Europe experiences ever increasing heat waves.

Some quick notes:

julialedur commented 6 years ago

Cool project! As improvements, I would:

I'm looking forward to seeing your next version! :)

castorsia commented 6 years ago

Update

Your project content: images/words/etc

The Iberian peninsula is reportedly about to break the European heat wave record, with temperatures expected to touch 50 degrees Celsius, while thousands have been mobilised to avert fires. Greece was ravaged by wildfires with the highest death toll in its history, more than 90 people. So, what worst time for an update.... Forest fires, to clear things up RE terminology, are fires that begin in forests. They expand to other types of terrains, in crops, houses, suburbs, villages etc. So, they (can) cause causalities.

Initially, I made the first graph easier to the eye.
1_graph-bar-5-largest_areas_burnt

And provided a bit of context for what is going on with Portugal.

2_graph-line-portugal-burnt-hectares-per-year

The patterns are quite strong, as we can see.

3_graph-line-top-5-fires-per-year

A closer look at Greece.

4_graph-bar-greece-burnt-hectares-per-year

Since we are speaking about Greece, I encountered the following problem: each summer the country is going up in flames, but due to its considerably smaller size than other countries in the graphs, the scale of the issue is not easy yo convey. 50.000 burnt hectares for Greece is huge, but for Spain, which is many times bigger than Greece... It is not.

So, I grabbed historic forestry data from the EU about the total forest cover of each country in 1990, summed up the burnt areas since 1990 until last year, and calculated what percentage of the 1990 forests, are the burnt areas.

I used pywaffle, which I don't really know how to use... Evidently. But it is a place to start.

spain_1990 cypr_1990 italy_1990 greece_1990 port_1990

Any changes in direction or topic?

I had to move away from absolute values to relative (percentages).

Code

(https://github.com/castorsia/data-studio/blob/master/code/project_2/Draft_1/Project_2.ipynb)

Problems/Questions

-I need to find a way to better visualise the percentage of lost forests. Ideas, and most importantly, ways that it can be done?

Delete this line and complain about any problems you've run into (solved or unsolved)

Tables, tables, tables! CSVs in different orientations, indexes, I had to manually change things.

Checklist