Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
https://www.unstructured.io/
Apache License 2.0
7.44k stars 580 forks source link

rfctr(html): drop convert_and_partition_html() #3215

Closed scanny closed 2 weeks ago

scanny commented 2 weeks ago

Summary Remove unstructured.partition.html.convert_and_partition_html(). Move file-type conversion (to HTML) responsibility to each brokering partitioner that uses that strategy and let them call partition_html() for themselves with the result.

Additional Context

Rationale: