worldveil / dejavu

Audio fingerprinting and recognition in Python
MIT License
6.36k stars 1.43k forks source link

make fingerprinted .wav file storage optional #4

Closed pguridi closed 10 years ago

pguridi commented 10 years ago

after fingerprinting a wav file (generated from a mp3), the wav file is not needed anymore. For matching the fingerprint, the original mp3 is enough. This way, a lot of space can be saved..

for example, I have a 10.000 songs in mp3 collection that i want to fingerprint, and keeping the wav files wastes a lot of space. After generating each fingerprint, I could just remove the wav file.

worldveil commented 10 years ago

Agreed. Seems like a good use case. Perhaps the dejavu.fingerprint() function should take a boolean argument and keep/remove the derivative wave files based on that. Would you be interested in adding this?

pguridi commented 10 years ago

sure, I'll work on this and #3 and send a pull request. btw, can you explain where does the "confidence" value comes from, in the readme.md?.(and what exactly means)

worldveil commented 10 years ago

great!

the confidence value is the number of hashes that not only matched, but were "aligned" as well. See "Fingerprint Alignment" for exactly what that means.