WM-SEMERU / ds4se

Data Science for Software Engineering (ds4se) is an academic initiative to perform exploratory and causal inference analysis on software engineering artifacts and metadata. Data Management, Analysis, and Benchmarking for DL and Traceability.
https://wm-csci-435-f19.github.io/ds4se/
Apache License 2.0
7 stars 3 forks source link

Learn how to call the estimate_effect method of the CausalModel class #106

Open scheurich-sarah opened 3 years ago

scheurich-sarah commented 3 years ago

Estimating the effect is the third step in the causal process defined in the doWhy documentation. Learn the difference between the identification methods (Regression, Stratification, Matching,Weighting, Instrumental Variable, Regression Discontinuity) and which may be most applicable to our data. Learn the keyword args.

Reference that uses all available estimation types: https://microsoft.github.io/dowhy/example_notebooks/dowhy_estimation_methods.html

103

scheurich-sarah commented 3 years ago

Implemented vanilla causal framework on a built-in doWhy data set. All built in data sets come with structural causal models already encoded. This implementation follows the starter notebook ConfoundingExample on the doWhy website. This starter notebook seemed most relevant as it uses regression and explores data looking for confounders. Documented nuances about the estimate_effect method of CausalModel class from exploring source code.