BVLC / caffe

Caffe: a fast open framework for deep learning.
http://caffe.berkeleyvision.org/
Other
33.96k stars 18.72k forks source link

caffe time -model -weights -gpu=0 #7050

Open everjcc opened 2 years ago

everjcc commented 2 years ago

caffe time -gpu

Issue summary

caffe time -model=xxx -weighs=xxx -gpu=0 the log is: I0312 15:29:30.427956 2367 caffe.cpp:406] Average time per layer: I0312 15:29:30.427961 2367 caffe.cpp:409] data forward: 0.0018944 ms. I0312 15:29:30.427969 2367 caffe.cpp:412] data backward: 0.0018848 ms. I0312 15:29:30.427975 2367 caffe.cpp:409] conv1 forward: 0.10807 ms. I0312 15:29:30.427982 2367 caffe.cpp:412] conv1 backward: 0.182646 ms. I0312 15:29:30.427989 2367 caffe.cpp:409] relu1 forward: 0.0140288 ms. I0312 15:29:30.427994 2367 caffe.cpp:412] relu1 backward: 0.0018432 ms. I0312 15:29:30.428000 2367 caffe.cpp:409] norm1 forward: 0.0628864 ms. I0312 15:29:30.428007 2367 caffe.cpp:412] norm1 backward: 0.105226 ms. I0312 15:29:30.428014 2367 caffe.cpp:409] pool1 forward: 0.0158592 ms. I0312 15:29:30.428020 2367 caffe.cpp:412] pool1 backward: 0.0018784 ms. I0312 15:29:30.428027 2367 caffe.cpp:409] conv2 forward: 0.291235 ms. I0312 15:29:30.428033 2367 caffe.cpp:412] conv2 backward: 0.515402 ms. I0312 15:29:30.428040 2367 caffe.cpp:409] relu2 forward: 0.0101152 ms. I0312 15:29:30.428048 2367 caffe.cpp:412] relu2 backward: 0.0018592 ms. I0312 15:29:30.428056 2367 caffe.cpp:409] norm2 forward: 0.137219 ms. I0312 15:29:30.428066 2367 caffe.cpp:412] norm2 backward: 0.256826 ms. I0312 15:29:30.428073 2367 caffe.cpp:409] pool2 forward: 0.0133536 ms. I0312 15:29:30.428084 2367 caffe.cpp:412] pool2 backward: 0.0024768 ms. I0312 15:29:30.428092 2367 caffe.cpp:409] conv3 forward: 0.14239 ms. I0312 15:29:30.428098 2367 caffe.cpp:412] conv3 backward: 0.3532 ms. I0312 15:29:30.428107 2367 caffe.cpp:409] relu3 forward: 0.008976 ms. I0312 15:29:30.428114 2367 caffe.cpp:412] relu3 backward: 0.0020128 ms. I0312 15:29:30.428123 2367 caffe.cpp:409] conv4 forward: 0.117597 ms. I0312 15:29:30.428130 2367 caffe.cpp:412] conv4 backward: 0.292886 ms. I0312 15:29:30.428138 2367 caffe.cpp:409] relu4 forward: 0.0090048 ms. I0312 15:29:30.428145 2367 caffe.cpp:412] relu4 backward: 0.001872 ms. I0312 15:29:30.428153 2367 caffe.cpp:409] conv5 forward: 0.109824 ms. I0312 15:29:30.428160 2367 caffe.cpp:412] conv5 backward: 0.368051 ms. I0312 15:29:30.428165 2367 caffe.cpp:409] relu5 forward: 0.0088512 ms. I0312 15:29:30.428174 2367 caffe.cpp:412] relu5 backward: 0.0018848 ms. I0312 15:29:30.428182 2367 caffe.cpp:409] pool5 forward: 0.0117792 ms. I0312 15:29:30.428189 2367 caffe.cpp:412] pool5 backward: 0.00256 ms. I0312 15:29:30.428197 2367 caffe.cpp:409] fc6 forward: 0.417875 ms. I0312 15:29:30.428205 2367 caffe.cpp:412] fc6 backward: 3.15267 ms. I0312 15:29:30.428212 2367 caffe.cpp:409] relu6 forward: 0.0122656 ms. I0312 15:29:30.428264 2367 caffe.cpp:412] relu6 backward: 0.0018912 ms. I0312 15:29:30.428273 2367 caffe.cpp:409] drop6 forward: 0.0127136 ms. I0312 15:29:30.428282 2367 caffe.cpp:412] drop6 backward: 0.001856 ms. I0312 15:29:30.428292 2367 caffe.cpp:409] fc7 forward: 0.1988 ms. I0312 15:29:30.428300 2367 caffe.cpp:412] fc7 backward: 2.72682 ms. I0312 15:29:30.428308 2367 caffe.cpp:409] relu7 forward: 0.0122848 ms. I0312 15:29:30.428316 2367 caffe.cpp:412] relu7 backward: 0.0019136 ms. I0312 15:29:30.428328 2367 caffe.cpp:409] drop7 forward: 0.0126016 ms. I0312 15:29:30.428339 2367 caffe.cpp:412] drop7 backward: 0.0018944 ms. I0312 15:29:30.428347 2367 caffe.cpp:409] fc8 forward: 0.109283 ms. I0312 15:29:30.428378 2367 caffe.cpp:412] fc8 backward: 2.68584 ms. I0312 15:29:30.428388 2367 caffe.cpp:409] prob forward: 0.0146496 ms. I0312 15:29:30.428395 2367 caffe.cpp:412] prob backward: 0.0018528 ms. I0312 15:29:30.428421 2367 caffe.cpp:417] Average Forward pass: 55.8925 ms. I0312 15:29:30.428429 2367 caffe.cpp:419] Average Backward pass: 65.4428 ms. I0312 15:29:30.430272 2367 caffe.cpp:421] Average Forward-Backward: 127.954 ms. I0312 15:29:30.430285 2367 caffe.cpp:423] Total Time: 1279.54 ms. I0312 15:29:30.430291 2367 caffe.cpp:424] Benchmark ends

The sum of forward_time_per_layer is not equal to the average forward pass(2.01ms < 55.89ms) , please help me to solve it, thanks very much.