Contains the ML training pipeline. This repository is dedicated to storing all training-related files for the Machine Learning (ML) part of the course.
Create a local copy of this repository by cloning it
git clone https://github.com/remla23-team2/model-training.git
Run the following commands from your terminal in the application folder:
pip install -r requirements.txt
In your terminal, run the following commands:
python main.py
Open the terminal (in the application folder) and run the following commands to create a Docker image:
docker build -t ghcr.io/remla23-team2/model-training:VERSION .
+----------+
| get_data |
+----------+
*
*
*
+------------+
| preprocess |
+------------+
*
*
*
+-------+
| train |
+-------+
*
*
*
+----------+
| evaluate |
+----------+
+----------------------------+
| ..\output\getdata\data.tsv |
+----------------------------+
*
*
*
+----------------------+
| ..\output\preprocess |
+----------------------+
*
*
*
+-----------------+
| ..\output\train |
+-----------------+
*
*
*
+--------------------+
| ..\output\evaluate |
+--------------------+