newtfire / introDH-Hub

shared repo for DIGIT 100: Introduction to Digital Humanities class at Penn State Erie, The Behrend College
https://newtfire.github.io/introDH-Hub/
Creative Commons Zero v1.0 Universal
8 stars 4 forks source link

Mystery text discussion of dc.txt #67

Closed ebeshero closed 1 year ago

ebeshero commented 1 year ago

Post your screenshots and discuss your findings about dc.txt here!

JordanJ7 commented 1 year ago

I have noticed in ANTCONC with no restrictions to the ngram that there is an extreme amount of phrases such as "van Helsing" (322), "the professor"(150), "the door" (136). The most frequent phrase was "of the" (865). The KWIC phrase that I searched was "Van Helsing", from what appears to be a name that needs to be summoned such as summoning a demon or saying just an old falsified way of calling someone to another room. From what I can tell, this story has taken place a long time, and there is a doctor helping people.

Here are some exciting Finds

Screenshot 2022-10-17 210857

This shows one of the phrases that piqued my interest among all the other phrases and turns out to be just a name.

image

Here is a photo that shows what the most common words are with no restrictions.

epp5198 commented 1 year ago

antconc1 antconc2 While looking at this mystery text through AntConc I searched for an N-Gram of 3. The most common terms were "i could see"(73), "i could not" (68), and "dr van helsing" (67). While looking at "i could see" in KWIC it showed many interactions between the main character of the story and others signifying that the character is some sort of professional from the surrounding dialogue.