py-why / dowhy

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
https://www.pywhy.org/dowhy
MIT License
6.88k stars 916 forks source link

Add a modified version of the auto MPG dataset #1190

Closed bloebp closed 4 weeks ago

bloebp commented 1 month ago

Before, we were loading it directly from the UCI dataset repository. However, this cause connection issues and build fails. The dataset is now added directly with its respective license.

Further, removed the output of the causal discovery example notebook to ensure that it gets compiled while building.

bloebp commented 1 month ago

thanks for adding this, @bloebp. Right now, the license file is in the root directory with many other example notebooks. Would it be cleaner to create a new directory called datasets/auto-mpg or and then add both the csv and license file in that sub-folder? The only code change will be to specify the correct path in the notebook.

Makes sense! We probably should even move all datasets into that folder. But that should be a follow-up PR.