Letractively / boilerpipe

Automatically exported from code.google.com/p/boilerpipe
0 stars 0 forks source link

Ignore FORM tags in HTMLHighlighter #45

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
When using HTMLHighlighter some times boilerpipe keeps some artifacts related 
coming from FORM and LABEL tags.

This can be easily prevented by addding a new ignorable element to TAG_ACTIONS 
map in HTMLHighlighter.java:

TAG_ACTIONS.put("FORM", TA_IGNORABLE_ELEMENT);

Original issue reported on code.google.com by xavi.beu...@gmail.com on 24 Mar 2012 at 6:40

GoogleCodeExporter commented 9 years ago

Original comment by ckkohl79 on 25 Mar 2012 at 2:12