jsoma / data-studio-projects

12 stars 18 forks source link

National Anthems - a comparison [Project] #28

Closed peterjost closed 6 years ago

peterjost commented 7 years ago

Pitch

Summary

On August 1st the Swiss celebrate their National Day. As in many other countries the National Day is THE DAY for playing the National Anthem. Before it starts I want to explore and to compare the National Anthems of Europe in terms of topic, word frequency, length, music style and more.

Type of visualization: word clouds, bars, mapping. Audio and video on the website would be a plus.

Details

Possible headline(s):
National Anthems - keep it short and violent // Violent French, fraternized Italians, devout Swiss

Data set(s): Lyrics and music files: http://www.laenderservice.de/hymnen/ Lyrics and additional information: https://www.bpb.de/system/files/dokument_pdf/4808b_hymnen-booklet_i_140429.pdf Music files (MP3): http://www.bpb.de/internationales/europa/europaeische-union/171039/die-nationalhymnen-der-28-eu-mitgliedstaaten

Code repository: https://github.com/peterjost/studio-projects

Possible problems/fears/questions:

Work so far

First examples of National Anthems word clouds.

Switzerland: image

Germany: image

Checklist

This checklist must be completed before you submit your draft.

simon-pinkmartini commented 7 years ago

Nice Idea! Some people think bad of wordclouds but in this case it's a very powerful tool to summarize the data and bring across a couple of good points. Also, I like the idea of coloring the charts in the respective flag colors of a country. Can you compare groups of countries against one another (eg european vs asian countries)?

playfairbot commented 7 years ago

Greetings! This is the Playfair Bot, just having a little look around.

Please post your project draft! It should be posted by Friday. More details available here. If you posted one but I'm not seeing it, make sure you followed the template.

It looks like we need to fix up your pitch a little bit! Edit it by clicking the pencil in the top right-hand corner. It requires:

If you see a lot of errors, make sure you used the templates correctly!

chrgavalas commented 7 years ago

I really like the fact that it's a simple idea but still very informative and it speaks in the heart of every person! I would like to see more correlations related to the topic of each anthem. And maybe pick the 2-3 more interesting ones in terms of the findings and explain more about the historical events that are connected to the lyrics so as to give it a serious journalistic spin.

peterjost commented 7 years ago

Update

Content

The most frequent words of the national anthem of each european country as a word cloud:

europe_wordcloud

bildschirmfoto 2017-07-24 um 16 12 54

bildschirmfoto 2017-07-24 um 16 03 29

Any changes in direction or topic?

No major changes.

Problems/Questions

Dataset is incomplete (only the neighbour countries of Switzerland yet). I have to fix it as fast as possible. Visuals are still basic.

Checklist

peterjost commented 7 years ago

Update

Content

image

Any changes in direction or topic?

No sound, no videos.

Problems/Questions

Visuals are still basic.

Checklist

peterjost commented 7 years ago

Project visuals/text

image

Details

Headline: The Sound of blood and patriotism

Published website version: https://peterjost.github.io/studio-projects/nationalanthems/

Code repository: https://github.com/peterjost/studio-projects

Final data set(s): https://github.com/peterjost/studio-projects/blob/master/code/National_Anthems/National_Anthems.csv

https://github.com/peterjost/studio-projects/blob/master/code/National_Anthems/national_anthems.ipynb

What did you find to be the most difficult part of this project?

It was difficult coming out of the starting blocks, to find the right pace through the project schedule and to get comfortable with the project-templates. And I had to create the dataset by my own, collect the date from different sources.

Are you satisfied with what you produced? Is there anything you would like to change or improve?

It is a first step - not more, not less. But there are many things to improve. I have no problem to find a story line. Handling the data, graph them and made the graphs nicer and more accurate with the best matching tool are the topics I have to work on.

Checklist

sarahslo commented 7 years ago

Hi Peter, interesting topic. I've never seen this done before. First, yes, I agree on not liking word clouds. The problem with them is that if the word Tree and the word Constantinople are used at the same frequency, guess which one is more prominent....?!

Since you already have a good start using words-as-data let me suggest taking it a step further. With the age chart, I was going to suggest you flip the axis. We generally thinking of timelines as horizontal. But the how do you do the labeling if the dots stack up? What if, you used the names as the data, but like the word cloud but instead of a random arrangement (which is another reason I'm not a fan of word clouds) you simply stacked them up along the appropriate years?

You'd have something like

screen shot 2017-07-29 at 17 18 15

Also note here that I've arranged the countries alphabetically. I don't believe you have them in any order by year, so that is a question you should ask yourself whenever you have data: what order should they be in?

And what if you took that same idea and applied it to the count of the words in the lyrics. The words are the data. That would look something like this.

screen shot 2017-07-29 at 17 28 27

These are suggestions that are a little unconventional. It is important to get comfortable with the primary charting language first -- but, I thought I'd share this with you to keep in your back pocket for the future for next time you work with language (so you can skip word clouds!)

:)

'

thisss commented 7 years ago

That looks great, Pesche. If you have a lot of spare time, you might also dive into the music. There are libraries like Music 21. Maybe it would be possible to analyze the MIDI files of anthems. I never tried it, though.