Currently html_text doesn't extract text from button values (e.g. <input type="submit" value="Send"> ). It may be nice to have them extracted, at least optionally - they may be helpful as features for ML algorithms, and they look like text. But I don't know how important is this feature :)
Currently html_text doesn't extract text from button values (e.g.
<input type="submit" value="Send">
). It may be nice to have them extracted, at least optionally - they may be helpful as features for ML algorithms, and they look like text. But I don't know how important is this feature :)