Unstructured-IO / unstructured-api

Apache License 2.0
528 stars 110 forks source link

fix: remove `parent_id` until backwards compat bug is addressed #252

Closed awalker4 closed 1 year ago

awalker4 commented 1 year ago

Original issue: https://github.com/Unstructured-IO/unstructured-api/issues/237 Core library fix: https://github.com/Unstructured-IO/unstructured/pull/1526

Anyone who calls partition_via_api will hit this bug until they upgrade unstructured, which includes any Langchain users of UnstructuredAPIFileLoader. The immediate fix is to remove parent_id from the hosted api. Next, we can ensure that langchain users are up to date. Finally, the core library fix above will address any new fields going forward. It will be safe to readd the parent_id once users are generally on unstructured>=0.10.15.

awalker4 commented 1 year ago

Note we should merge https://github.com/Unstructured-IO/unstructured-api/pull/249 before cutting the release as this is another high priority fix.