GeroVanMi / algorithmic-quartet-mlops

A showcase Machine Learning Operations (MLOps) Project.
0 stars 1 forks source link

Define Dataset for training #1

Closed GeroVanMi closed 6 months ago

GeroVanMi commented 6 months ago

In order to train a diffusion model we need a dataset. The data can either be static or continously added.

Patrickliuu commented 6 months ago

DVC for data versioning make currently no sense, as we are not trying to re-train the model.

GeroVanMi commented 6 months ago

@Patrickliuu It seems you have decided on the Pokemon data, is that right?

Patrickliuu commented 6 months ago

The dataset is available under: huggan/pokemon