Generate_train_data problem

CDECatapult / ml-performance-prediction

Code that accompanies the paper "Predicting the Computational Cost of Deep Learning Models"

Apache License 2.0

20 stars 11 forks source link

Hi @s9013xx,

Sorry for the late response.

I worked on the same project for my research as well, data generation needs to be done using tensorflow's docker 1.10.1-gpu image to avoid issues. Also, the parameters used to generate data in this project are extreme (they were originally intended to run for long weeks), hence I'd recommend you reduce the values of following parameters (in the arguments or the benchmark scripts themselves) to stop memory issues:

Batch size
Input and output dimensions
Matrix size (very important to reduce)
Number of tests (num_val)
Number of test repetitions (repetitions)
Number of warmup tests (iter_warmup)
Number of benchmarks (iter_benchmarks)

I'd also recommend you take a look at earlier commits and the mlpredict repository (https://github.com/CDECatapult/mlpredict) if you want to see the model itself.

If you need any other help feel free to contact me, but I don't guarantee responses since I'm busy with other commitments.

CDECatapult / ml-performance-prediction

Generate_train_data problem #1