Unstructured-IO / unstructured-api-tools

Apache License 2.0
28 stars 10 forks source link

Ability to accept gzip compressed files #107 #153

Closed kravetsmic closed 1 year ago

kravetsmic commented 1 year ago

Added supporting gzip-compressed files. Users can send gzip files, mixed with non-gzip files and the API will decompress files and return normal results.

Updated templates for API, and added function for decompressing .gz files. Updated tests: added .gz files to tests.

Bumped version to 0.9.0


Closes issue #107

cragwolfe commented 1 year ago

@kravetsmic ,actually, this doesn't quite close https://github.com/Unstructured-IO/unstructured-api-tools/issues/107 , where the spec also specifies the ability for an optional parameter to be posted with files, e.g. -F 'gz_uncompressed_content_type=applicaton/pdf.

See the last 3 examples in https://github.com/Unstructured-IO/community/blob/main/specs/Pipeline-API-gz-spec.md .

But, it's natural for this to be added into an additional PR, so I merged this as is. Thanks!