We use a combination of non-public, internal information as well as a number of static rules and analyses to obtain the ground truth labels.
but what is the actual accuracy?
For example, the file "5e939818321bcd64cab2f711bb273c0d51479b08fb0f1371d39a6c88a294b02b" has a packed label of 0, but analysis shows that it is packed with UPX and can be unpacked.
I think the correct packed label for this file should be 1.
This dataset is labeled
but what is the actual accuracy? For example, the file "5e939818321bcd64cab2f711bb273c0d51479b08fb0f1371d39a6c88a294b02b" has a packed label of 0, but analysis shows that it is packed with UPX and can be unpacked. I think the correct packed label for this file should be 1.