cmda-tt / course-17-18

🎓 Frontend 3 · 2017-2018 · Curriculum and Syllabus 📊
https://cmda-tt.github.io/course-17-18/
Other
39 stars 64 forks source link

Assessment 3 data: @krsty #542

Closed krsty closed 6 years ago

krsty commented 6 years ago

For the third assessment I am going to use data of curse words used in Tarantino films:

movie type word minutes_in  
Reservoir Dogs word dick 0.40  
Reservoir Dogs word dicks 0.43  
Reservoir Dogs word fucked 0.55  
Reservoir Dogs word fucking 0.61
Reservoir Dogs word bullshit 0.61
Reservoir Dogs word fuck 0.66
Reservoir Dogs word shit 0.90  
Reservoir Dogs word fuck 1.43  
Reservoir Dogs word dicks 1.56  
Reservoir Dogs word fuck 1.66  

Contents

I'm using this dataset because it has a lot of data that can be used in different ways. This way I can make multiple visualisations. I also find it interesting because I love Tarantino films.

Visualisation ideas

The dataset has timestamps of when the words are said, so I can make a visualisation with a timeline. I can also try and filter out a certain word, and make it apparent in which movie that word was said the most. Maybe there are several trends to be discovered in Tarantino's use of bad words, atleast that's what I am hoping to find.

Type of visualisation

For the filtering of words I am thinking of using a line chart. With a line chart it should be easy to see which words were said the most in which movie. I could maybe also use a donut or pie chart for this.

For the timeline i'm thinking of using a scatter plot, because you can show more than 3 variables, time, movie and what word was said.

Razpudding commented 6 years ago

Cool data, pretty poor description of the concept. Curious to see what you end up making :)