Closed asfimport closed 16 years ago
Chris M. Hostetter (@hossman) (migrated from JIRA)
There does not appear to be a bug here.
As the javadocs for this class state...
The HtmlDocument class creates a Lucene Document from an HTML document.
It does this by using JTidy package.
JTidy is then complaining about errors in your HTML document ... notably that it doesn't seem to be valid html.
Writing e-mail parser, and we are impeded by this error.
Migrated from LUCENE-1041 by DURGA DEEP, resolved Nov 01 2007 Environment: