Error while uploading document foo.html: NullPointerException: Cannot invoke "org.dkpro.core.api.xml.CasXmlHandler$StackFrame.isCaptureText()" because the return value of "java.util.Deque.peek()" is null
Looks like the importer doesn't like any type of characters outside the root <html>/</html> tags. I.e. removing the doctype and and potentially trailing empty lines and linebreaks makes the import work.
Describe the bug When I import
Error while uploading document foo.html: NullPointerException: Cannot invoke "org.dkpro.core.api.xml.CasXmlHandler$StackFrame.isCaptureText()" because the return value of "java.util.Deque.peek()" is null
Please complete the following information: