Open GoogleCodeExporter opened 8 years ago
Hi,
Save it as HTML page and use below lines to extract text as it is.
Reader r = new InputStreamReader(new FileInputStream("D:/test1.htm"));
String text = CommonExtractors.ARTICLE_EXTRACTOR.getText(r);
System.out.println("Text:"+text);
Regards,
Vanaja Jayaraman
Original comment by vanaja.u...@gmail.com
on 22 May 2014 at 12:01
Original issue reported on code.google.com by
ranjanba...@iblogee.com
on 2 Feb 2014 at 12:44