ICIJ / datashare

A self-hosted search engine for documents.
https://datashare.icij.org
GNU Affero General Public License v3.0
598 stars 54 forks source link

feat: embedded artifacts #1536

Closed bamthomas closed 2 months ago

bamthomas commented 2 months ago

See #1165 for the list of commits. Changes are made in extract-lib. https://github.com/ICIJ/extract/commit/f09254eb0d05c54c8df657acf73641d3193e2f5b

If null is provided in the EmbeddedDocumentExtractor as the artifactPath then memory is used, else the provided path will be used as file cache for embedded files.

pirhoo commented 2 months ago

Well done!