standardebooks / web

The source code for the Standard Ebooks website.
https://standardebooks.org
Creative Commons Zero v1.0 Universal
236 stars 66 forks source link

Should reading time also be based on text complexity? #40

Closed robinwhittleton closed 4 years ago

robinwhittleton commented 5 years ago

At the moment it’s just number of words with a scaling factor.

acabal commented 5 years ago

I don't know if there's a way to calculate that. IIRC right now it's just based on the average adult WPM reading speed.

acabal commented 5 years ago

If you can find some kind of accepted way of calculating it, like Flesch-Kincaid calculates reading ease, then we can certainly update the algorithm.

robinwhittleton commented 5 years ago

This Readability plugin calculates reading time based on Flesch-Kincaid and the average time of “reading for comprehension”: https://github.com/mikestecker/craft-readability#measuring-how-long-it-should-take-to-read. The Wikipedia article they took their data from seems to be unsourced though. If we take some sample books of around the same length from SE and run them through that formula the data changes as so:

That feels more realistic than what we currently have, but it’s not exactly scientific, or at least there are no referenced papers to back it up. I’ll carry on looking.

robinwhittleton commented 5 years ago

We should probably add “Approximately” in front of the time as well, regardless of whatever solution we go with.

acabal commented 5 years ago

Meh, that method is just as fuzzy as ours. If we're going to change this then it should be based on some more widely accepted standard, not what some random plugin decided to do.

acabal commented 4 years ago

Closed due to inactivity

digitalethics commented 4 years ago

Linking to related Foliate issue to possibly deduplicate efforts.