machawk1 / ArchiveThumbnails

An implementation for Ahmed AlSum's "Thumbnail Summarization Techniques for Web Archives"
GNU General Public License v2.0
7 stars 5 forks source link

Use hashing mechanism other than md5 for unique thumbnails #108

Open machawk1 opened 7 years ago

machawk1 commented 7 years ago

Though this tool does not necessarily deal with security, we use md5 hashing to distinguish unique URIs, as they're more portable than passing around a URI as a parameter (encoding/decoding would be needed). The recent news of sha1 collisions warrants investigating using another hashing algorithm (e.g., BLAKE2) for distinguishing URIs.