chrismattmann / tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Apache License 2.0
1.51k stars 236 forks source link

Updated tika to use sha1 hash instead of md5 for checksum #399

Open griffin-rickle opened 1 year ago

griffin-rickle commented 1 year ago

SHA1 has been deprecated in FIPS and there are suggested steps to move away from the algorithm, but it is still supported for "Non-digital-signature applications" (CTRL-F for SHA-1 to find the supported usages table in the PDF). The maven repository supplying the Tika JAR only provides md5 and sha1 checksums, or an .asc file for verifying the signature of the file (which is not the same as verifying the checksum), so I have implemented and update to use sha1 instead of md5.

chrismattmann commented 1 year ago

Thank you for this @griffin-rickle let me take a look. Is there a way to do this in a back compat way (see my comments on #348 )

griffin-rickle commented 1 year ago

Thank you for this @griffin-rickle let me take a look. Is there a way to do this in a back compat way (see my comments on #348 )

Not sure if you saw my comment on the original Issue thread, but this has been done. Let me know if there's anything else I could do!