amiratag / DataShapley

Data Shapley: Equitable Valuation of Data for Machine Learning
MIT License
256 stars 66 forks source link

Question in TMC #2

Closed yangc22 closed 5 years ago

yangc22 commented 5 years ago

https://github.com/amiratag/DataShapley/blob/7d64ad67f2f40ec07fb9a72fc7a37201487c8af7/DShap.py#L255 Hi,

why do we restart the model when the second conditional flow is met?

Thanks

Bee-zest commented 5 years ago

it'll save your data and other stuff in some directory. You should delete them before re-run the code.

tabularML commented 5 years ago

It's to force the users to define a restart_model() method if the user is going to use a custom classifier to prevent possible bugs.

yangc22 commented 5 years ago

got it, thanks so much!