DARPA-ASKEM / terarium

https://app.terarium.ai
Apache License 2.0
13 stars 2 forks source link

[TASK]: Cache Document extraction #4418

Open YohannParis opened 1 month ago

YohannParis commented 1 month ago

Describe the task

kbirk commented 3 weeks ago

This is a bit tricky based on how it is currently done. It isn't an operation that returns a response that can be cached. It's a very long method that does several incremental changes that involve updating an object in place such that values are available for subsequent processing.

The proper way to do this would require splitting the logic into one stateless method that returns a final response containing all information required, and then another method that takes that response and applies it to a document.

This will let us cache the responses and then apply it to documents with the same input.

YohannParis commented 1 week ago

@dvince2 @dgauldie how would you like to proceed on this? I think it's important to do the caching for that purpose.

YohannParis commented 1 week ago

Per discussion with @dgauldie: