kupolak / textstat

Ruby gem to calculate statistics from text to determine readability, complexity and grade level of a particular corpus.
MIT License
31 stars 9 forks source link

CEFR levels #32

Open kupolak opened 4 years ago

kupolak commented 4 years ago

It would be nice to introduce a new calculation feature for CEFR levels.

scarroll32 commented 1 year ago

@kupolak I would be interested in contributing. Do you know of anywhere I can find a formula?

kupolak commented 1 year ago

@scarroll32 Hi! Sadly, I don't know any formula for CEFR levels. You can try to find lists of A1, A2, B1, C1, C2 words. I think for English it wouldn't be that difficult to find, I don't know about other languages. And based on that lists you can determine the % of word usage and its difficulty in a text.