NVIDIA / caffe

Caffe: a fast open framework for deep learning.
http://caffe.berkeleyvision.org/
Other
672 stars 263 forks source link

Slow compile time #502

Closed dashesy closed 6 years ago

dashesy commented 6 years ago

The compile time of NVIDIA/caffe (Cuda9 and CUDNN 7) is much slower than BVLC/caffe. In both cases I compile with "All" architectures because I do not know which one may be running on the target cluster. Is this expected because of new architectures?

drnikolaev commented 6 years ago

Hi @dashesy , it takes about 1m45s to build it on my box. Yes, it is slower to build than BVLC because of fp16 support - you actually build both versions, fp32 and fp16.

dashesy commented 6 years ago

Is there a way to only compile for float32?

drnikolaev commented 6 years ago

Not at this moment, sorry