ganler / ResearchReading

General system research material (not limited to paper) reading notes.
GNU General Public License v3.0
20 stars 1 forks source link

TACO'2020 | IR2Vec: LLVM IR Based Scalable Program Embeddings #58

Closed ganler closed 3 years ago

ganler commented 3 years ago

https://dl.acm.org/doi/pdf/10.1145/3418463

ganler commented 3 years ago

Task: convert LLVM-IR into embedding vectors.

image

Symbolic level

Flow-aware (w/o GNN structure)

    - opcode vector, type vector, and {variable0, 1, 2, ...} (all values from the triplets)

In addition, add instruction embedding according to USE-DEF chain;

image

Result

Faster (training time) & higher acc.

Key design choices

GitHub: https://github.com/IITH-Compilers/IR2Vec