WM-SEMERU / ds4se

Data Science for Software Engineering (ds4se) is an academic initiative to perform exploratory and causal inference analysis on software engineering artifacts and metadata. Data Management, Analysis, and Benchmarking for DL and Traceability.
https://wm-csci-435-f19.github.io/ds4se/
Apache License 2.0
7 stars 3 forks source link

Develop potential outcome graphical model with preprocessing as intervention #111

Open scheurich-sarah opened 3 years ago

scheurich-sarah commented 3 years ago

Preprocessing as a binary treatment: you have two treatments and we can observe the counterfactual by applying both kinds of preprocessing to the data

treatment 0 (control): processed with word2vec

Causal effect evaluates what would happen if you use doc2vec instead of word2vec For any individual requirement, we can observe

We compute the average of this result for every requirement ACE=E[Y_1 ]−E[Y_0 ]

109