Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
https://www.unstructured.io/
Apache License 2.0
8.65k stars 704 forks source link

rfctr(html): clean html tests in prep for PRs to follow #3156

Closed scanny closed 3 months ago

scanny commented 3 months ago

Summary Clean tests_unstructured/partition/test_html.py in preparation for broader refactor of HTML partitioner to follow. That refactor will address a cluster of bugs.

Temporarily remove blank lines in tests so reordering tests in following commit is easier to follow. Those will go back in after that.