newtfire / introDH-Hub

shared repo for DIGIT 100: Introduction to Digital Humanities class at Penn State Erie, The Behrend College
https://newtfire.github.io/introDH-Hub/
Creative Commons Zero v1.0 Universal
10 stars 4 forks source link

Mystery Text Discussion: web.txt #107

Open ebeshero opened 2 weeks ago

ebeshero commented 2 weeks ago

Post your screenshots and discuss your findings about web.txt here!

GabVoz13 commented 1 week ago

For this assignment, I chose to analyze the content of the "www.txt" file using AntConc. As I experimented with the software, I observed that many of the frequently repeated words in the text were "fillers," such as "and," "but," "for," and "a." These words dominated the search results in the NGRAM tab, with most appearing over 100 times. When I shifted to the KWIC (Key Word in Context) view, one word in particular stood out: "for." The KWIC search revealed that "for" commonly appeared in specific phrases, such as "for the most part" (11 times), "for the first time" (9 times), and "for a moment" (19 times). This insight highlighted for me how often certain phrases are used in everyday language, even if we may not consciously notice their frequency.

Screenshot 2024-11-04 at 3 00 15 PM Screenshot 2024-11-04 at 3 00 21 PM Screenshot 2024-11-04 at 3 08 24 PM Screenshot 2024-11-04 at 2 58 52 PM Screenshot 2024-11-04 at 2 59 04 PM