newtfire / introDH-Hub

shared repo for DIGIT 100: Introduction to Digital Humanities class at Penn State Erie, The Behrend College
https://newtfire.github.io/introDH-Hub/
Creative Commons Zero v1.0 Universal
8 stars 4 forks source link

Mystery Text discussion of pcc.txt #78

Closed ebeshero closed 11 months ago

ebeshero commented 1 year ago

Post your screenshots and discuss your findings about pcc.txt here!

ebeshero commented 1 year ago

newt-mosaic4

mew-II commented 1 year ago

pcc.txt was certainly an interesting story to look at through AntConc and Voyant's eyes. Both pointed out different perspectives, AntConc looked at repeated words and phrases which lead to some interesting out of context statements like the ones seen in this screenshot: deathByViolence My favorite of which is definitely "thrown into the water immediately after death by violence." I love how it contrasts with the two that follow about a nice garden. And somehow also works with the repeated "ugh(s)" that precede it.

Voyant seems almost critical of the text pointing out the severe lack of vocabulary differentiation. In the top five words used Said and Say are both present. lackOfVocabulary I assume that most people will also post this but I thought I'd include it regardless as it shows off the vocab usage pretty effectively. words The two perspectives together leave a lot to fill in on my own, due to the high amount of say and said, I expect a large dialogue focus in the text, and based on the content of the text I assume there is some sort of murder or battle that occurs at some point in the story. The Author uses very basic dialogue which makes most of the low ngram word clouds look like an English teacher's "Don't use these phrases/words" poster. Finally, I also noticed that phrases, even ones up to 10 words in a row were repeated up to 3-4 times, I have no idea why this would be.

ceq5032 commented 1 year ago

Screenshot 2023-03-16 at 12 32 48 PMimage Screenshot 2023-03-16 at 12 30 52 PMimage Screenshot 2023-03-16 at 12 48 38 PMimage Screenshot 2023-03-16 at 12 47 31 PMimage Screenshot 2023-03-16 at 1 08 09 PMimage

sammoniot commented 1 year ago

Something about the text that I found intriguing was that the word "great" had a continuous decrease in the number of times used throughout the document. This makes me think that things were possibly getting worse over time or "less great." Screenshot (14)

In addition, as the ngram increased, the lower the frequency got.

Screenshot (12) Screenshot (13)

Another part of the document that I found interesting was that the phrase "time out of mind" appeared nine times. That phrase is not used very often in recent years. The only thing that I could think of is a 1997 Bob Dylan album with the same title.

Screenshot (15)

Another word that is not used very often but was used twenty-eight times was "thence" which further reminds the reader of the time period that was probably written a long time ago. Screenshot (16) Looking through the document and analyzing its texts has reminded me about how the words we use can change over a small period of time.