janih / boilerpipe

Boilerplate Removal and Fulltext Extraction from HTML pages
2 stars 0 forks source link

Patch for /trunk/boilerpipe-core/src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java #42

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Break after tagging a TextBlock a candidate title, there is no need to continue 
checking the rest of the potential titles for the current TextBlock.

Original issue reported on code.google.com by tucker...@gmail.com on 20 Mar 2012 at 8:08

Attachments:

GoogleCodeExporter commented 9 years ago
Thanks, good point.

Original comment by ckkohl79 on 21 Mar 2012 at 9:11