googleprojectzero / functionsimsearch

Some C++ example code to demonstrate how to perform code similarity searches using SimHashing.
Apache License 2.0
559 stars 97 forks source link

Avoid adding functions of same name to the repulsion dataset #18

Open MohamadMansouri opened 5 years ago

MohamadMansouri commented 5 years ago

This was discussed here https://github.com/googleprojectzero/functionsimsearch/issues/17

Note: I am using the shell to do base64 decoding, I did this since I saw you reporting some problems with the python base64 method. This is costing some time. If the decoder of the base64 python3 library doesn't have such problems lets use it instead.

googlebot commented 5 years ago

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

:memo: Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here (e.g. I signed it!) and we'll verify it.


What to do if you already signed the CLA

Individual signers
Corporate signers
MohamadMansouri commented 5 years ago

I signed it!

googlebot commented 5 years ago

CLAs look good, thanks!