Open GoogleCodeExporter opened 8 years ago
here is what i did
ArticleExtractor EXTRACTOR = ArticleExtractor.getInstance();
HTMLHighlighter HH = HTMLHighlighter.newExtractingInstance();
InputSource inputSource = new InputSource(new StringInputStream(html));
TextDocument htmlDoc = new BoilerpipeSAXInput(inputSource).getTextDocument();
EXTRACTOR.process(htmlDoc);
html = HH.process(htmlDoc, html);
Original comment by tien.ngu...@sematext.com
on 28 Jun 2013 at 8:21
Original issue reported on code.google.com by
m...@ndthuan.com
on 30 Nov 2012 at 8:44