eregs / regulations-parser

Parser for U.S. federal regulations and other regulatory information
Creative Commons Zero v1.0 Universal
37 stars 39 forks source link

Web API: Answer question about hashing to avoid duplicate files. #303

Open tadhg-ohiggins opened 8 years ago

tadhg-ohiggins commented 8 years ago

Currently the Web API creates a hash of an uploaded file, then compares that hash to already-present files, and rejects duplicates.

The idea is that this will prevent users from inadvertently adding the same file multiple times, and help them realize when files that might have different names are actually the same file. (Having confused myself with this mistake when using the parser at the command line is what prompted this feature idea.)

tadhg-ohiggins commented 8 years ago

Addressed in https://github.com/eregs/regulations-parser/pull/317