decile-team / cords

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.
https://cords.readthedocs.io/en/latest/
MIT License
316 stars 53 forks source link

For GRAD_MATCH method, the weights associated with each data point in X(subset of training set) #33

Closed lishaguo closed 2 years ago

lishaguo commented 2 years ago
  1. For GRAD-MATCH method, there are weights associated with each data point in X(subset of training set). Do the weights have physical significance? for example, if the value of the weight is higher, the relevant selected data has the greater contribution to the residual?
  2. During the iteration, the selective index is in the selected indices, so the iteration break. why this happen? thanks@krishnatejakk
krishnatejakk commented 2 years ago

Included a new version of OMPSolver for GradMatch that fixes this issue.