I tried the text-extraction example with a .doc file that is converted from the document.docx (I attached my input document below), but I couldn't get any information from the file.
Expected Behavior
Can we get the text and other information like parsing a .docx file?
We should have something like this:
0
Text: Paragraph 1
Bold: false
Italic: false
........
FLATTENED:
Paragraph 1
Paragraph 2
Table 1
Column 1
Column 2
Row 1
Cell 1-1
Cell 1-2
Paragraph 3
Paragraph 4
Hi, I am a Text Box
Description
Expected Behavior
Actual Behavior
Input document: