Examine data for calibration drift contributions to labeling and model/test in reversed orders

Consider the following situation:

A particular sample of soil is run
Due to sensor drift, adhesions, temperature, or other hardware-related factors, one of the measured parameters begins steadily increasing.
A particular sample of microdebitage is run

In this case, the classification may perform better since the model may be able to depend on the drift of one of the measured features. In this case, the microdebitage may have larger (or smaller) values based on systematic error from the measurement device.

To combat against this, some approaches are:

Determine if there is sequential drift in measurements. Do all of the particles seem to be getting larger? Would this make sense from the point of view of the sieves/operation?
Try creating a model from samples created from running the soil first and then the microdebitage and measure the performance. Does it have the same performance if you were to run the microdebitage first and then the soil?

vanderbilt-data-science / ancient-artifacts

Examine data for calibration drift contributions to labeling and model/test in reversed orders #96