newtfire / introDH-Hub

shared repo for DIGIT 100: Introduction to Digital Humanities class at Penn State Erie, The Behrend College
https://newtfire.github.io/introDH-Hub/
Creative Commons Zero v1.0 Universal
8 stars 4 forks source link

Mystery Text discussion of a Radio Play #76

Closed ebeshero closed 11 months ago

ebeshero commented 1 year ago

Post your screenshots and discuss your findings about one or a group of radio plays here!

ebeshero commented 1 year ago

(Reminder: Don't do ac.txt b/c we did that in class on Tuesday March 14, so it's no longer a big mystery.) This response is just here as a demonstration of a comment in GitHub Issues. I found this interesting pattern in which "laughter" is a super large word in the Voyant wordcloud (cirrus) view of ac.txt. Screenshot 2023-03-14 at 13 01 00

cbl5678 commented 1 year ago

I did the text "generalmaxwelltaylor.txt." Ngrams 3 and 4 both produce frequency counts above 5. Ngram 3 produces some as high as 20. Ngram 3 is the only ngram that produces frequency values in the double digits. When you reach ngram 4, there is no longer any frequency values above 5. Some of the most frequent phrases are "do you think" and "general taylor i."

This text is an interview of General Maxwell Taylor. The way the text is written, we get a lot of repeated words because it labels who is speaking each time, like in a play. The interview tends to talk alot about armies and war. Without reading the text fully im having trouble determining what war they are talking about and where exactly General Taylor is from.

3_ngram most_used_words do_you_think 4_ngram