dpalmasan / TRUNAJOD2.0

An easy-to-use library to extract indices from texts.
https://trunajod20.readthedocs.io/en/latest/
MIT License
29 stars 7 forks source link

Implement a Hapax Legomena Index #28

Open dpalmasan opened 3 years ago

dpalmasan commented 3 years ago

This is defined as the number of words occurring once in the text. Might be a good estimate when comparing texts of similar length.