Right now, tables in Word files are completely skipped from indexing. This PR unwraps any simple tables (no nested tables, no omitted cells) on a row-by-row basis to include them with the indexed text. Some assumptions are made along the way:
The first row of the table is the heading
Entity attributes are in the table columns, with each table row representing an isolated entity.
Each unwrapped table row may look as follows:
No.: 2
Issue: The CD doesn’t play
Comments: The CD doesn’t start playback upon insertion into the drive. Furthermore, the drive LED doesn’t turn on.
Right now, tables in Word files are completely skipped from indexing. This PR unwraps any simple tables (no nested tables, no omitted cells) on a row-by-row basis to include them with the indexed text. Some assumptions are made along the way:
Each unwrapped table row may look as follows: