Was trying to get pipeline_merge_tables working and ended up finding a small issue. The default validation function breaks when there are no tables in the current or next page, which means that the pipeline can't scan any pages after the fact for merging.
Was trying to get
pipeline_merge_tables
working and ended up finding a small issue. The default validation function breaks when there are no tables in the current or next page, which means that the pipeline can't scan any pages after the fact for merging.After poking around a bit I noticed that it's because of the
break
's here: https://github.com/aws-samples/amazon-textract-response-parser/blob/3ba9b666a7ae8ba849003512ccb0bb8f331e35bc/src-python/trp/t_tables.py#L102 https://github.com/aws-samples/amazon-textract-response-parser/blob/3ba9b666a7ae8ba849003512ccb0bb8f331e35bc/src-python/trp/t_tables.py#L107Opening a PR to fix this, but for now if you need a workaround just change these to
continue
locally