Hello,
in the java version of Tika, in particular in the tika GUI app, there is the possibility to print the raw text (after the conversion from PDF, docx etc..) in several formats, like "Formatted text", "Plain Text", "Main Content", "Structured Text".
Is there the possibility to get this formats through tika python?
Many thanks in advance
Hello, in the java version of Tika, in particular in the tika GUI app, there is the possibility to print the raw text (after the conversion from PDF, docx etc..) in several formats, like "Formatted text", "Plain Text", "Main Content", "Structured Text". Is there the possibility to get this formats through tika python? Many thanks in advance