amiratag / DataShapley

Data Shapley: Equitable Valuation of Data for Machine Learning
MIT License
255 stars 66 forks source link

Show performance graph by gradually removing worst values instead of best values #8

Closed rbunn80110 closed 4 years ago

rbunn80110 commented 4 years ago

The example notebook generates a performance graph by removing the best data points gradually. I'm trying to figure out how the accuracy changes when the worst performing data points are removed gradually. I can't seem to figure out where the values of the worst data points are being gradually removed for the chart and then how to do the opposite.

Thanks,

Bob

tabularML commented 4 years ago

You can use the same 'performance_plots' method but feed negated values (or use the '_portion_performance' directly).