TeamHG-Memex / html-text

Extract text from HTML
MIT License
130 stars 24 forks source link

button values? #3

Open kmike opened 7 years ago

kmike commented 7 years ago

Currently html_text doesn't extract text from button values (e.g. <input type="submit" value="Send"> ). It may be nice to have them extracted, at least optionally - they may be helpful as features for ML algorithms, and they look like text. But I don't know how important is this feature :)

kmike commented 7 years ago

End users can still extract these values explicitly, without using html_text.