i-dot-ai / redbox

Bringing Generative AI to the way the Civil Service works
https://i-dot-ai.github.io/redbox/
MIT License
85 stars 28 forks source link

Feature/use unstructured local api #917

Closed gecBurton closed 1 month ago

gecBurton commented 1 month ago

Context

As an Engineer I want to use the official Unstructured image instead of our own code so that:

  1. our docker images are smaller
  2. we are writing the least code ourselves

Changes proposed in this pull request

  1. using unstructured api image
  2. worker calls unstructured image via http

out of scope:

  1. UnstructuredLargeChunkLoader & UnstructuredTitleLoader are nearly identical and should be merged

Guidance to review

Relevant links

build: https://github.com/i-dot-ai/redbox/actions/runs/10265038401

Things to check