Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
https://www.unstructured.io/
Apache License 2.0
7.37k stars 572 forks source link

chore: Add test that tests all the different file types in example-docs #3277

Open potter-potter opened 6 days ago

potter-potter commented 6 days ago

Adds a test that tests 23 different file types. Essentially all the file types in the example-docs (pdf is tested 3 times making it 25)

There will be a similar test in core-product that was inspired by this test.