remla23-team2 / model-training

Contains the ML training pipeline
MIT License
0 stars 0 forks source link

Model-training

Contains the ML training pipeline. This repository is dedicated to storing all training-related files for the Machine Learning (ML) part of the course.

Instructions

Clone the Repository

Create a local copy of this repository by cloning it 

git clone https://github.com/remla23-team2/model-training.git

Install Requirements

Run the following commands from your terminal in the application folder:

pip install -r requirements.txt

Run the main

In your terminal, run the following commands:

python main.py

Docker

Open the terminal (in the application folder) and run the following commands to create a Docker image:

docker build -t ghcr.io/remla23-team2/model-training:VERSION .

DVC DAG

 +----------+  
 | get_data |  
 +----------+  
       *       
       *       
       *       
+------------+ 
| preprocess | 
+------------+ 
       *       
       *       
       *       
  +-------+    
  | train |    
  +-------+
       *
       *
       *
 +----------+
 | evaluate |
 +----------+

DVC DAG Outputs

+----------------------------+ 
| ..\output\getdata\data.tsv |
+----------------------------+
               *
               *
               *
   +----------------------+
   | ..\output\preprocess |
   +----------------------+
               *
               *
               *
      +-----------------+
      | ..\output\train |
      +-----------------+
               *
               *
               *
    +--------------------+
    | ..\output\evaluate |
    +--------------------+

Code Coverage Report

badge

badge