Open thesamovar opened 3 years ago
Probably not useful here but I have used Mozilla's readability.js, which is a library for heuristic analysis of HTML. It powers the "Reader" view in Firefox.
Not sure if it will be useful or not at this stage, but good to know that it exists!
https://mozilla.github.io/pdf.js/