Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.
Perform subset selection experiments on synthetic data with a detailed visualization of the subsets selected and the testing performance of the models when trained on these subsets.