dsp-uga / elizabeth

Scalable malware detection
MIT License
0 stars 0 forks source link

Fix issue where every data file got the same id #5

Closed zachdj closed 6 years ago

zachdj commented 6 years ago

I'm using an extra function to create a new scope for the id variable in each iteration. See #4 for more details.

Another potential fix is to include id as a default parameter in the lambda function like data = [rdd.map(lambda x, id=id: (x, id)) for id, rdd in data.items()] but that seems more hacky.

This pull request also includes some changes to .gitignore because I wasn't sure how to exclude them :frowning_face: