newtfire / introDH-Hub

shared repo for DIGIT 100: Introduction to Digital Humanities class at Penn State Erie, The Behrend College
https://newtfire.github.io/introDH-Hub/
Creative Commons Zero v1.0 Universal
8 stars 4 forks source link

Mystery Text Discussion: pcc.txt #86

Closed ebeshero closed 7 months ago

ebeshero commented 10 months ago

Post your screenshots and discuss your findings about pcc.txt here!

Matthew-W8 commented 10 months ago

The information I have gathered would lead me to believe that this mystery text is some kind of story. The first clue for me was the frequency of the word 'said' in the mystery text, which lead me to believe that there was some form of dialogue in the text. Said could also be used in news article, but I don't think it would be the most repeated word. said-word-cirrus It appears the most out of any words present in the story, being used 661 times. The next clue was in AntConc's N-Grams. At an N-Gram count of 5, the word 'chapters' appeared continually. strange-ngrams At N-Grams of 6, characters started to appear. Their names also give clues to where the story could originate or where it takes place as they appear to be French, and one of the in context views mentions a Rue Morgue and Rue is the French word for street. text-names Other details can be assumed through the word cirrus. It's likely that time plays it to the plot some how due to the frequency it appears. Little also appears quite frequently, but viewing it in context it appears to just be adjective the author like to do. The TermsBerry also doesn't reveal to much. Most connections are only one or two strong so it doesn't really lead to any assumptions. I haven't been able to decipher much of the plot, but I believe I have figured out the setting and that it is a story.

I also just found this N-Gram very interesting. It doesn't really mean much, I just thought it was worth mentioning. cough