yeomko22 / TIL

Today I learned
1 stars 0 forks source link

Monitoring Machine Learning Models in Production #93

Open yeomko22 opened 2 years ago

yeomko22 commented 2 years ago

1. The ML system life cycle

스크린샷 2021-09-09 오후 7 00 38

  1. Model building

    • understanding the problem
    • data preparation
    • feature engineering
    • initial code, rough jupyter notebook
  2. Model Evaluation and Experimentation

    • feature selection
    • hyper paramter tuning
    • comparing the effectiveness of different algorithms on the given problem
    • notebooks with stats, graph evaluating feature weights, accuracy, precision, Receiver Operating Characteristics
  3. Productionize Model

    • preparing it so it can be deployed
    • production grade code
    • different programming language and framework
  4. Testing

    • ensuring that production code behaves in the way we expect it to
    • test cases
  5. Deployment

    • API for accessing the model
  6. Monitoring and Observability

    • ensure our model is doing what we expect it to in production.
yeomko22 commented 2 years ago

2. What makes ML system monitoring hard

yeomko22 commented 2 years ago

3. Why you need monitoring

data skew

Model Staleness

Negative Feedback Loops

yeomko22 commented 2 years ago

4. Key Principles for monitoring your ML system

yeomko22 commented 2 years ago

5. Understanding the spectrum of ML Risk Management

스크린샷 2021-09-09 오후 10 00 46
yeomko22 commented 2 years ago

6. Data Science Monitoring

model input monitoring

model prediction monitoring

advanced techniques

yeomko22 commented 2 years ago

7. Operations Monitoring Concerns

Operational concern around ML system

Three pilars of observability

Monitoring vs Observability

yeomko22 commented 2 years ago

8. Bringing Ops & DS Together - Metrics with Prometheus & Grafana

Pros

Cons

Metrics for Machine Learning

yeomko22 commented 2 years ago

9. Bringing Ops & DS Together - Logs

Pros

Cons