Closed michaelneale closed 4 years ago
What I am trying to capture is the need for MLOps to validate and prevent these cases (which I think is an MLOps problem). Either by gating the training data or at the least the model.
I think I will close this as the other PR combined the challenge and requirement together
Note: Need a challenge section added that online learning sits outside of the normal pipeline of train, test, deploy.. this PR could evolve to that. even if are retraining at runtime, perhaps there should be a feedback of the state of the model to the CI/CD environment for continuous verification (a kind of monitoring).
will open in a new PR
Added a challenge section on reinforcement learning and the "swearing problem"