YunchaoYang / Blogs

blogs and notes, https://yunchaoyang.github.io/blogs/

0 stars 0 forks source link

MLOps #27

Open YunchaoYang opened 1 year ago

YunchaoYang commented 1 year ago

Outline

This post will elaborate on how to effectively use Machine Learning infrastructure. Namely, the best practice in building, maintaining and scaling production-ready deep learning systems.

0) Build a production ready deep learning pipeline

Tensoflow extended (TFX) pytorch ? Nvidia ?

1) Kubernetes with Google Cloud: Deploy your Deep Learning model effortlessly.

2) Scalability in ML

3) Docker containers and Docker Compost

4) uWSGI Nginx serving a Tensorflow model to users with Flask, uWSGI as a web server and Nginx as a reverse proxy.

5) Deploy a Deep Learning model as a web application with Flask

references:

https://theaisummer.com/topics/mlops/

YunchaoYang commented 1 year ago

The MLOps projects are roughly categorized into the following:

Training Orchestration
Model Monitoring
Model Testing
Model Serving
Data Versioning
Feature Store
Experiment Tracking
Explainability

Another different categorization strategy:

Automates ML workflow
CI/CD for ML
cron job scheduler: Tools for monitoring cron jobs, a command line job scheduler on Unix like operating system
Data Catalog
Data Enrichment
Data Exploration
Data Management
Data Processing
Data Validation
Data Visualization
Feature Engineering
Feature Store
Hyperparameter Tuning
Knowledge Sharing
Machine Learning Platform
and so on

YunchaoYang commented 1 year ago

develop ML products and rapidly bring them into production.

automate and operationalize ML products

concept drift/data drift

YunchaoYang commented 1 year ago

key ideas:

1. Model metadata storage and management 2. Data and pipeline versioning 3. Hyperparameter tuning 4. Run orchestration and workflow pipelines 5. Model deployment and serving 6. Production model monitoring