newtfire / introDH-Hub

shared repo for DIGIT 100: Introduction to Digital Humanities class at Penn State Erie, The Behrend College
https://newtfire.github.io/introDH-Hub/
Creative Commons Zero v1.0 Universal
8 stars 4 forks source link

Mystery Text Discussion: pcc.txt #98

Open ebeshero opened 5 months ago

ebeshero commented 5 months ago

Post your screenshots and discuss your findings about pcc.txt here!

ssp5426 commented 5 months ago

0001 0002 0003 0004

Voyant Tools- I was surprised to see the findings when analyzing the text of the pcc.txt in voyant tools given that while exploring it shows how unique the structure of the entire text is given that a lot of text is permitted in the file. I was amazed that it displays the unique word forms as opposed to only showing the total words in the file which was very useful to me when I was analyzing how many of the common words were used in the lines of text. I was also very interested to see the overall highlights of the words while working in voyant which made it easier for me to follow along while reading the text.

Antconc- It was fascinating that it shows the one-of-a-kind word structures rather than just the appearance of the words in the document. It was exceptionally helpful to me when I was breaking down the number of the n-grams used which I used 4 as I wanted to stay consistent while analyzing. I was pretty interested to see the general features of the words while working in antconc which made it simpler for me to track while reading the text. I was also very shocked to see the discoveries while examining the text of the pcc.txt in antconc given that while investigating it shows how it is not only structured but how it follows that structure to pull- out important key terms being used in the paragraph to analyze it more consistently.