dsp-uga / elizabeth

Scalable malware detection
MIT License
0 stars 0 forks source link

Add preprocess for asm files #8

Closed cbarrick closed 6 years ago

cbarrick commented 6 years ago

This parses the asm files into an RDD of the form RDD[id, (segment, addr, bytes, opcode, rest)].

The benefit of using the asm files is that the bytes are split at instruction boundaries, but I've yet to find a representation that clusters similar instructions. It's at least a good starting point for working with asm.