chrismattmann / tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Apache License 2.0
1.49k stars 234 forks source link

Allow raw /rmeta output #375

Closed tballison closed 1 year ago

tballison commented 1 year ago

The parse option uses /rmeta and then conflates all the keys together so that users can't tell which metadata goes with the primary file and which metadata goes with which embedded file.

Would it be possible to warn users of the current behavior and offer an option to get the raw json from the /rmeta endpoint?

chrismattmann commented 1 year ago

will test/evaluate as part of the next release (not 1.24.2 which I will release today but ) 2.6.x which I will release next week.

chrismattmann commented 1 year ago

Fixed in https://github.com/chrismattmann/tika-python/commit/ca2a692193307feec4277142f88fd1afd819f140