REMLA24-Team-5 / Model-Training

The model-training repository contains all the code required to train and test a phishing URL detection machine learning model. It includes scripts for data preprocessing, model training, evaluation, and performance testing to ensure robust and accurate model development.
0 stars 0 forks source link

Set dataset up in the remote dvc #18

Closed blibliboe closed 2 months ago

blibliboe commented 2 months ago

Added the functionality to not download the database but have the pipeline do that for you

Timdnb commented 2 months ago

Shouldn't we also include the dataset as a dependency for the dvc pipeline here?

blibliboe commented 2 months ago

Shouldn't we also include the dataset as a dependency for the dvc pipeline here?

Yes, you are right, changed the way the dataset was gotten remotely, is now connected to the dvc pipeline instead of the code