Open GoogleCodeExporter opened 9 years ago
I've attached the .awk script, in case you are interested.
Original comment by dv...@ukr.net
on 16 Oct 2012 at 5:36
Attachments:
Any comments or advice, please?..
Original comment by dv...@ukr.net
on 23 Jan 2013 at 10:27
Any comments or advice, please?..
Original comment by dv...@ukr.net
on 12 Jun 2013 at 11:48
You should use fuzzy hash (google this: ssdeep ) for binary and text files.
Comparing sound or image files is a different story.
Original comment by mgr.inz....@gmail.com
on 22 Jul 2013 at 10:53
Thanks, but ssdeep does not seem to do what I expect. As an example, let's take
autoruns.exe (649864 bytes) and
autorunsc.exe (567944 bytes) from SysInternals - these are GUI and Console
version of the same program. When executing "jdiff.exe autoruns.exe
autorunsc.exe a.patch" (JDIFF - Jojo's binary diff), it produces 'a.patch' with
a size of 190457 bytes. So, raw similarity of these two files can be calculated
as (1 - 190457/567944) = 0.66466 = 66.466%.
In the same time, ssdeep shows 0. I think it's because ssdeep does not detect
moved blocks whereas both jdiff and xdelta does.
Original comment by dv...@ukr.net
on 13 Aug 2013 at 9:09
Original issue reported on code.google.com by
dv...@ukr.net
on 15 Oct 2012 at 7:42