grangier / python-goose

Html Content / Article Extractor, web scrapping lib in Python
Apache License 2.0
3.98k stars 788 forks source link

h1,h2...h6 not returned #242

Open tamimibrahim opened 9 years ago

tamimibrahim commented 9 years ago

When I extracted articles from any page, I have noticed it don't return any heading "tag" like h1,h2...h6 value in cleaned_text.

Is that normal for everyone or I have missed anything?

mircealungu commented 8 years ago

This is a feature that would be indeed nice. I don't think it's part the current version of the project.