Open ebeshero opened 1 month ago
A ngram size of 7 gives frequency counts as high a 9. While an ngram size of 6 gives frequency counts in the double digits, with one at 10. There are 46 phrases that occur between 5-10 times. Based off some of the phrases I believe this text is from Lazy Susan.
I found myself most interested in the longer n-grams with high frequency ratings. For instance, I found three n-gram strings over 8 that appeared a minimum of five times. As they all appear to reference name in specific orders, it seems to me that they refer to letters being sent within the context of the story. It is interesting to see such specific orders repeated this many times. Similarly, as seen above, there are four n-grams of 5 that appear over 25 plus (!!!) times in the text. Once more, this is incredibly interesting, as the phrases used above seem specific. Its hard to believe there would be over 25 contexts in which they could be used across the story segment. My findings overall found the same results to Jocelyn - with several n-grams above 6 or 7 giving results in the double digits, and several phrases used between 5 and 10 times. Hits of shorter n-gram lengths could be found in the hundreds, which isn't as surprising. Several noteworthy locations are mentioned. The British setting is obviously incredibly important.
These are my findings from Voyant and Antconc. I noticed the higher the n-gram size was the highest frequency was 5. When the n-gram size was lower, like 5, the frequency was in the double digits. Common phrases repeated were"a quarter of an hour", "I do not know what", and "in the course of the".
The words that were used the most were "of the" and "a quarter of an hour". When I changed the n-gram the frequency of words changed. I used the n-gram 3 and 5. The next two common phrases were "to be" and "i do not know what".
My findings are mostly from Antconc, as the Voyant tools was causing significant lag on my laptop. Some common phrases which appeared in Antconc were as follows: "Type mrs vernon to lady de courcy churchhill" which appeared 9 times, "Type adeiu laura letter th laura to marianne" which appeared 5 times, and "Type lady de courcy churchhill my dear mother" which appeared 5 times as well. These are size 7 N-grams, which i found to be the most revealing about what the text was about. I suspect this was a letter from a "Mrs. Vernon" to her mother, described as "Lady de Courcy Churchhill".
I noticed that mr was the most used word in voyant when I looked at mr in the KWIC I seen that the only name that shows up on the right context is Darcy so I am assuming that he is a important character in the story or that the story is about him.
Post your screenshots and discuss your findings about cac.txt here!