jreinhardt / readable.js

A javascript library for readability analysis
4 stars 1 forks source link

Dale Chall Readability Formula #5

Open jreinhardt opened 11 years ago

jreinhardt commented 11 years ago

The word list was already added in f57ca8c79406833a33ab8cce1e737272344eec20

http://en.wikipedia.org/wiki/Dale-Chall_Readability_Formula

Implement it by using a Bloom Filters

http://www.jasondavies.com/bloomfilter/

jreinhardt commented 11 years ago

Probably we need to apply a stemmer to the wordlist and the text, otherwise the number of false negatives is rather large.

Is it possible to detect British english, because that gave quite wrong results in a first test?