issues
search
DonaldTsang
/
stylo
Stylometry in Python - Simplified
0
stars
0
forks
source link
How variable may a constant be? Measures of lexical richness in perspective
#1
Open
DonaldTsang
opened
4 years ago
DonaldTsang
commented
4 years ago
[ ] alpha-chars-ratio the fraction of total characters in the paragraph which are letters
[ ] digit-chars-ratio the fraction of total characters in the paragraph which are digits
[ ] upper-chars-ratio the fraction of total characters in the paragraph which are upper-case
[ ] white-chars-ratio the fraction of total characters in the paragraph which are whitespace characters
[ ] type-token-ratio ratio between the size of the vocabulary (i.e., the number of different words) and the total number of words
[ ] hapax-legomena the number of words occurring once
[ ] hapax-dislegomena the number of words occurring twice
[ ] yules-k a vocabulary richness measure defined by Yule
[ ] simpsons-d a vocabulary richness measure defined by Simpson
[ ] brunets-w a vocabulary richness measure defined by Brunet
[ ] sichels-s a vocabulary richness measure defined by Sichel
[ ] honores-h a vocabulary richness measure defined by Honore
[ ] average-word-length average length of words in characters
[ ] average-sentence-char-length average length of sentences in characters
[ ] average-sentence-word-length avarage length of sentences in words