Average word length
Vocabulary richness (e.g., type-token ratio)
Frequency distribution of words, bigrams, and trigrams
Frequency of stop words
Frequency of rare words or unique terms
Syntactic Features:
Average sentence length
Sentence complexity (e.g., number of clauses per sentence)
Frequency of different part-of-speech (POS) tags
Dependency parsing patterns
Stylistic Features:
Frequency of punctuation marks
Use of passive voice
Readability scores
Lexical Features:
Average word length Vocabulary richness (e.g., type-token ratio) Frequency distribution of words, bigrams, and trigrams Frequency of stop words Frequency of rare words or unique terms
Syntactic Features:
Average sentence length Sentence complexity (e.g., number of clauses per sentence) Frequency of different part-of-speech (POS) tags Dependency parsing patterns Stylistic Features:
Frequency of punctuation marks Use of passive voice Readability scores
Semantic Features:
Sentiment analysis scores