cdimascio / essence

Automatically extract the main text content (and more) from an HTML document
Apache License 2.0
116 stars 16 forks source link