Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
https://www.unstructured.io/
Apache License 2.0
8.66k stars 707 forks source link

rfctr: clean MSG partitioner and tests as prep #3107

Closed scanny closed 4 months ago

scanny commented 4 months ago

Summary Fix type errors and generally prepare partition_msg() and its tests for refactoring to use python-oxmsg library instead of the problematic msg_parser library for partitioning Outlook MSG files.