janih / boilerpipe

Boilerplate Removal and Fulltext Extraction from HTML pages
2 stars 0 forks source link

Tags missing in output html #26

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
The problem is the htmlhighlighter process seems to omit opening header tags 
<H1>, <H2> etc but includes the closing tags </H2> thus titles don't stand out 
in the output document etc

What version of the product are you using? On what operating system?

boilerpipe 1.1.0 ubuntu 10.10
Please provide any additional information below.

Original issue reported on code.google.com by *nicho...@arachnys.com on 6 Jul 2011 at 2:11

GoogleCodeExporter commented 9 years ago
Thanks for reporting.
Please check again using the just released version 1.2.0 and report if this 
fixes your bugs.

Original comment by ckkohl79 on 6 Jul 2011 at 2:51

GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
checked with version 1.2.0 and the bug is fixed

Thanks

Original comment by *nicho...@arachnys.com on 6 Jul 2011 at 3:36

GoogleCodeExporter commented 9 years ago
Cheers :)

Original comment by ckkohl79 on 6 Jul 2011 at 3:37