newtfire / introDH-Hub

shared repo for DIGIT 100: Introduction to Digital Humanities class at Penn State Erie, The Behrend College
https://newtfire.github.io/introDH-Hub/
Creative Commons Zero v1.0 Universal
8 stars 4 forks source link

Voyant Tools Intro Exercise #31

Closed ebeshero closed 3 years ago

ebeshero commented 3 years ago

Experiment with Voyant Tools on any text you wish, perhaps readings from our class, or from sites of interest to you.

nmiano commented 3 years ago

The Text I surveyed was the illustration "Percy Jackson and The Lightning Thief" by Rick Riordan

https://voyant-tools.org/?corpus=543c5ea4822a178b51c174426d9ae70c

Comment: What's interesting about the text is that it didn't reveal as much words regarding the Greek gods than I expected. What also surprised me was that "Grover" was mentioned more than "Annabeth." I really expected to see "said" to be the biggest word in this text because it's a very conversation driven text.

ebeshero commented 3 years ago

@nmiano See if you can take a screen capture and save locally and then see if you can insert the image in your comment! I think that's what we did last fall.

jbg5721 commented 3 years ago

Link to my surveyed website: https://climate.nasa.gov/effects/ I chose this site just because it is a topic I am interested in.

Comment: What I find interesting is about this graph is that there are not many "common words." I expected to see at least a few common words, such as: 'the.' I am not surprised to see climate, change, heat, sea, ice, heat, all over the place

VoyantSS_JG
amayadwillis commented 3 years ago

I used the text from The Scarlet Letter, I found it on The Project Gutenberg website here: https://www.gutenberg.org/files/25344/25344-h/25344-h.htm Capture1 Capture

The main thing I was interested in looking at for this book was the most used words. I was a little surprised to see that "letter" and "scarlet" weren't even in the top ten. The top five most used words are "Hester," "Little," "Pearl," "Old," and "Child." I think it goes to show that although the story is called "The Scarlet Letter" the book isn't about Hester's shame and punishment, its actually about her and her love for her child.

ras6164 commented 3 years ago

I analyzed a research article on Facebook advertising which can be read here: http://designdamage.com/wp-content/uploads/2009/04/Understanding-the-Value-of-a-Social-Media-Impression.pdf.

download (2)

For most of the words, I expected to see them as some of the top words used in the text (impressions, ad, organic, etc). However, I was shocked to see homepage and exposed as while I remember some of that in the reading, it was not a key takeaway that stuck with me afterwards. In addition to this, I found it interesting that social was used a lot more often than media. I would have thought that those words would not have such a wide range of use as it is a phrase that is commonly used together in the text.

erinmooney commented 3 years ago

I used Plant Lore, Legends, and Lyrics by Richard Folkard from Project Gutenberg. (https://www.gutenberg.org/ebooks/44638) The outcome was :

PlantLoreWordBubble

For text about Plant Lore, the Word Bubble words are about what I expected! "Called" was a bit of a surprise, but I guess it makes sense because the text needs to identify different plants. "According" was also not expected, but from this word I can extrapolate that this text is more like a historical viewpoint of lore instead of a cultural one. Like, instead of telling the story of the plant, it is more like, "according to sources, this plant..." I really like this Bubble, and how the words really suit their source!

kjc6167 commented 3 years ago

I chose an article on NASA's Perseverance Mars rover. (https://mars.nasa.gov/news/8875/testing-proves-its-worth-with-successful-mars-parachute-deployment/)

image image

I thought it was interesting to look at which words were most commonly used throughout the article. Since the title was "Testing Proves Its Worth With Successful Mars Parachute Deployment," I assumed that "testing", "Mars", and "parachute" would be the most common words. Although "Mars" was the most used word, "rover", "perseverance", and "mission" were used much more than the previously mentioned list. However, I was not surprised that these words were very common in this article.

chrisklein10 commented 3 years ago

Link to surveyed website: https://liveforlivemusic.com/news/john-lennon-gimme-some-truth-deluxe-box-set/

Something interesting to me is the words of social media on here because to think he died a way long time before social media is a thing and that he is still so relevant that there is an account under his name is amazing. Patterns of the words don't really surprise me except there is one word, which is jazz that shocks me a bit.

Screenshot (58)

Jakeruff29 commented 3 years ago

I read a biography on Dale Earnhardt, one of NASCARs best drivers.
download Something interesting I found is that even if you had no idea who Dale Earnhardt was, you would be able to gather much information just from the words shown; such that "championship" or "championships" were displayed 17 times, and the word "win" was displayed 9 times. Something else interesting that I had found was that "Dale" was referred to 14 times and "Earnhardt" was referred to 26 times, almost double. Also interesting is that "Sr." or "senior" did not make the commonly used words list even though Dale Earnhardt can refer to Dale Earnhardt Jr. as well.

luh429 commented 3 years ago

I used the Lorax as my text. Here is the link to the pdf I used: http://dep.wv.gov/WWE/Programs/nonptsource/Documents/TheLorax.pdf I wasn't very surprised that words like Lorax, trees, and truffula were some of the most common words. However I found it odd that the word "ler" as in "Once-ler" was used quite frequently but the word "once" wasn't even in the top 55 words. I also thought it was comical that most of the most common words were made up words. For example Lorax, truffula, ler, thneed, biggering, and biggered. Lorax

ebeshero commented 3 years ago

@luh429 That's an interesting discovery you've made! I think Once- and -ler got cut in half by the - (hyphen) in between them. I wonder if Voyant is keeping only the second half of a hyphenated word?

Awebster7 commented 3 years ago
Screen Shot 2021-03-03 at 10 54 16 PM
debbie-evans commented 3 years ago

After trying to think of something serious to look at over the past day or so, I decided to put the Bee Movie script into Voyant (https://web.njit.edu/~cm395/theBeeMovieScript/)

Bee Movie

I can't say that anything in the word frequency really surprised me, the top words that aren't ones that are commonly used in dialogue (I'm, it's, just, etc.) were exactly what I expected (bee, bees, honey, etc.). One thing that I did find interesting is that the word "thinking" appears in the script twenty-seven times, but the reason that number is so high is because the phrase "thinking bee" is used eleven times in quick succession.

aidanvray commented 3 years ago

I decided to analyze Donald Trump's most recent speech from this year's CPAC convention. I got the transcript from https://www.rev.com/blog/transcripts/donald-trump-cpac-2021-speech-transcript.

trump

Due to the way the transcript was formatted, it stated Trump's name along with timestamps above every line he said. That's why his first and last name are the most repeated things from the transcript. It's also why "01" was repeated a few times. Aside from that a lot of phrases related to his most recent talking points were repeated a fair amount. Some of these include election, won, Biden, China, and Democrats. The only thing that really surprised me was that rigged and fraud were barely used in the speech.

WillStill commented 3 years ago

I used the text from a short story called Safari by Jennifer Egan. https://www.newyorker.com/magazine/2010/01/11/safari-3 TheSafariVoyantTools I noticed that the names of characters were the most prevalent, which should be expected. The next most prevalent words were names of relationships between the characters. "father," "children," and "group," are the few that show. After that, a lot of the words found were of no significant importance. Some of the most prevalent words were "it's," "she'll," "like," "he's," and "she's."

thammer12499 commented 3 years ago

index The subject of the doc was a research paper into the effects of blue light on the human eye and its potential effects on the brain. I'm not surprised by some of the words to come up so frequently but rather I'm surprised by the number of filler words used frequently within the document. It's not something I often notice even reading aloud.

SoulEpicness commented 3 years ago

I Decided to look at speed runs and wanted to look at a guide for Dark Souls. Link: Dark Souls speedrunning Dark Souls words

I thought that in the text I would see more terms that were based on movement tactics and shortcuts. I instead saw Dark, Souls, and speedsouls more than I thought I would have, but after seeing them I saw words I thought I might see such as stamina, menuing, and route.

TOMZS1 commented 3 years ago

The text that I used is from "Alice in Wonderland", by Lewis Carrol. This is one of my favorite books and I thought it would be interesting to see the most frequent words in this book.

Alice in Wonderland

Alice in Wonderland

I like how you can tell what the story's about. Its like the main things that people know about the story are the top words used in this book.

Tonya-Velcko commented 3 years ago

The text I used was Shakespeare's Macbeth from this link: http://shakespeare.mit.edu/macbeth/full.html voyant image I expected the characters' names to be common, but there were some words that were a surprise, like witch, sleep, and exeunt.