Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
https://www.unstructured.io/
Apache License 2.0
7.4k stars 573 forks source link

feat(pptx): add coordinate metadata to PPTX elements #3248

Open scanny opened 1 week ago

scanny commented 1 week ago

Summary As a page (slide)-oriented document format, the position of each shape on its slide is specified in the document. This information could be transformed appropriately and included in the coordinates metadata for elements formed from PPTX documents.