mr-ma / sip-eval

1 stars 0 forks source link

We need a table in tex format that dumps some info about programs in the dataset #30

Open mr-ma opened 6 years ago

mr-ma commented 6 years ago

The table should include number of (LLVM) instructions, percentage of input dependent instructions, percentage of input independent, programming language (c or c++), line-of-code for all the programs in the dataset. Use tabulate library for dumping tex files, see measure-protection-coverage.py

anahitH commented 6 years ago

To generate table first run dataset_info.sh bash script. It will dump statistics files for input dep/indep coverage. Statistics file contains number of LLVM instructions too. Then run dataset_info.py python script to create the table. Programming languages and source code lines of the programs are hardcoded in the python script.

mr-ma commented 6 years ago

@anahitH Is line of code correct? I think you're just counting the line of code in the test files, you should take the library code into account as well, just like the number of LLVM instructions.

anahitH commented 6 years ago

@mr-ma, you'r right. For the games the lines number is correct as there are no statically linked libraries. I'll update numbers for the rest of programs.