NorskRegnesentral / shapr

Explaining the output of machine learning models with more accurately estimated Shapley values
https://norskregnesentral.github.io/shapr/
Other
138 stars 32 forks source link

Set n_batches by default #327

Closed martinju closed 1 year ago

martinju commented 1 year ago

Introduces n_batches = NULL by default in explain(). The larger n_batches is, the less memory is consumed, but the computation time also increases slightly. Setting n_batches = NULL (the default) now chooses a reasonable default with a trade-off between computation speed and memory consumption. This default behavior will be updated later when a full performance comparison is ran, also including number of cores when parallelizing. Note that n_batches is also used for parallelization and for progressbar.

Other edits

TODO in this PR: