Closed arcaputo3 closed 1 month ago
tika.parser.enableXMLOutput
@aamend FYI - also seems to improve LLM use cases for, for example, tabular pdfs
Nice job! Thanks for the contrib
tika.parser.enableXMLOutput