eth-easl / modyn

Modyn is a research-platform for training ML models on growing datasets.
MIT License
25 stars 3 forks source link

Add option to clean up database and temp files after pipeline #577

Open MaxiBoether opened 1 month ago

MaxiBoether commented 1 month ago

When running a lot of experiments where we are only interested in the pipeline log, it can help to have an option which cleans up all information from the database and temporary files (e.g., models) to save disk space. For example, when running Criteo experiments, it sometimes happened that the huge DLRM snapshots filled up the disk.