Open DhvananShah-Reflektion opened 6 years ago
Could you please add some details:
CPU Model : 2.2 GHz Intel Core i7 Is BLAS impl same as the Blas Vendor? Blas Vendor: [OPENBLAS] If this is not the one, how can I find out the BLAS impl?
full model name please On Thu, Jun 28, 2018 at 13:40 Dhvanan Shah notifications@github.com wrote:
CPU Model : 2.2 GHz Intel Core i7 Is BLAS impl same as the Blas Vendor? Blas Vendor: [OPENBLAS] If this is not the one, how can I find out the BLAS impl?
— You are receiving this because you commented.
Reply to this email directly, view it on GitHub https://github.com/deeplearning4j/deeplearning4j/issues/5746#issuecomment-401166360, or mute the thread https://github.com/notifications/unsubscribe-auth/ALru_0P_wSkTnUX906ENNRYarOn3DDs_ks5uBT9UgaJpZM4U8CM0 .
CPU Model : Intel(R) Core(TM) i7-4770HQ CPU @ 2.20GHz
Sounds about right to me. You could use DL4J 1.0.0-SNAPSHOT and MKL to get a bit better performance though.
We did some quick timing runs on my PC earlier (Windows + 8 core 5960x @ 4Ghz)... around 700ms for MKL. So we do want to look into this a bit more - might be some scope for improvement.
I am trying to run image feature extraction on a pre-trained VGG16 model using the model zoo. After loading the model and passing my images through the model, the feed forward step -
Map<String, INDArray> stringINDArrayMap = vgg16.feedForward(image, false);
is taking roughly ~1000ms for a single image. I just wanted to know if this kind of time is expected for running an image through the feed forward step. Is there anything I am doing wrong, or can change any configurations to increase the resources given to it to decrease execution time.Code:
Env : OS : Mac OS X CPU Model : 2.2 GHz Intel Core i7 Blas vendor: [OPENBLAS] o.n.l.a.o.e.DefaultOpExecutioner - Cores: [8]; Memory: [3.6GB];
Aha! Link: https://skymindai.aha.io/features/DL4J-93