Tensorrt get whole output distributions

YaYaB commented 4 years ago

Configuration

Version of DeepDetect:
- Locally compiled on Ubuntu 18.04 LTS
Commit (shown by the server when starting):
- Branch Master, 6d6c79aaf43171a93dba38ba79ac5f0207f21c71
GPUS:
- 1 x Nvidia GTX 1080Ti

Your question / the problem you're facing:

I have an issue getting the whole distribution of predictions using Tensorrt. I took the model available on dd's website and named age_real https://deepdetect.com/models/init/desktop/images/classification/age_real.tar.gz

Error message (if any) / steps to reproduce the problem:

Download the model

mkdir age_real && cd age_real
wget https://deepdetect.com/models/init/desktop/images/classification/age_real.tar.gz
tar -xvf age_real.tar.gz

Let's try this model with the caffe backend.
Launch Dede

Api call

./dede --port 8080

Serveur log output

DeepDetect [ commit 6d6c79aaf43171a93dba38ba79ac5f0207f21c71 ]
[2020-03-31 17:39:39.638] [api] [info] Running DeepDetect HTTP server on localhost:8080

Create service Api call


curl -X PUT "http://localhost:8080/services/age" -d '{
   "mllib":"caffe",
   "description":"object detection service",
   "type":"supervised",
   "parameters":{
     "input":{
       "connector":"image",
       "height": 224,
       "width": 224
     },
     "mllib":{
       "nclasses":100,
       "gpu": true,
       "gpuid": 0,
       "net":{
        "test_batch_size": 1
       }
     }
   },
   "model":{
     "repository":"PATH_TO/age_real
   }
 }'

Serveur log output

{"status":{"code":201,"msg":"Created"}}

- Create Prediction

Api call

curl -X POST "http://localhost:8080/predict" -d '{ "service":"age", "parameters":{ "input":{ "width":224, "height":224 }, "output":{ "best": -1 }, "mllib":{ "gpu": true, "gpuid":0 } }, "data":["https://images.unsplash.com/photo-1580128660010-fd027e1e587a?ixlib=rb-1.2.1&ixid=eyJhcHBfaWQiOjEyMDd9&auto=format&fit=crop&w=500&q=60"] }'


Serveur log output:


Here I got the whole distribution of the predictions using the flag "mllib.best" to -1.
Now let's try to do the same with Tensorrt v5.1.

- Launch Dede

Api call

./dede --port 8080

Serveur log output

DeepDetect [ commit 6d6c79aaf43171a93dba38ba79ac5f0207f21c71 ] [2020-03-31 17:39:39.638] [api] [info] Running DeepDetect HTTP server on localhost:8080


- Create service
Api call

curl -X PUT "http://localhost:8080/services/age" -d '{ "mllib":"tensorrt", "description":"object detection service", "type":"supervised", "parameters":{ "input":{ "connector":"image", "height": 224, "width": 224 }, "mllib":{ "datatype": "fp32", "maxBatchSize": 1, "maxWorkspaceSize": 6096, "tensorRTEngineFile": "TRTengine_bs", "gpuid":0 } }, "model":{ "repository":"/mnt/terabox/research/age-classification/models/yaya/age" } }'

Serveur log output

{"status":{"code":201,"msg":"Created"}}

- Create Prediction

Api call

curl -X POST "http://localhost:8080/predict" -d '{ "service":"age", "parameters":{ "input":{ "width":224, "height":224 }, "output":{ "best": -1 }, "mllib":{ "gpu": true, "gpuid":0 } }, "data":["https://images.unsplash.com/photo-1580128660010-fd027e1e587a?ixlib=rb-1.2.1&ixid=eyJhcHBfaWQiOjEyMDd9&auto=format&fit=crop&w=500&q=60"] }'


Serveur log output:

{"status":{"code":200,"msg":"OK"},"head":{"method":"/predict","service":"age","time":642.0},"body":{"predictions":[{"classes":[],"uri":"https://images.unsplash.com/photo-1580128660010-fd027e1e587a?ixlib=rb-1.2.1&ixid=eyJhcHBfaWQiOjEyMDd9&auto=format&fit=crop&w=500&q=60"}]}}


As you can see I get empty prediction. However If I remove the "mllib.best" I get the best_match.

{"status":{"code":200,"msg":"OK"},"head":{"method":"/predict","service":"age","time":612.0},"body":{"predictions":[{"classes":[{"last":true,"cat":"64","prob":0.039911042898893359}],"uri":"https://images.unsplash.com/photo-1580128660010-fd027e1e587a?ixlib=rb-1.2.1&ixid=eyJhcHBfaWQiOjEyMDd9&auto=format&fit=crop&w=500&q=60"}]}}


Now if I try putting "mllib.best" to 1 or another value here is what I get a result very different with the category 0 with a low probability:

{"status":{"code":200,"msg":"OK"},"head":{"method":"/predict","service":"age","time":2921.0},"body":{"predictions":[{"classes":[{"last":true,"cat":"0","prob":0.000175380046130158}],"uri":"https://images.unsplash.com/photo-1580128660010-fd027e1e587a?ixlib=rb-1.2.1&ixid=eyJhcHBfaWQiOjEyMDd9&auto=format&fit=crop&w=500&q=60"}]}}



I would like to get the whole distribution but it seems that the element "mllib.best" does not work as it should.

beniz commented 4 years ago

Can you try "best":0 ? I believe we have a wrong test against 0 instad of < 0.

beniz commented 4 years ago

Actually the pathway is wrong in tensorrlib.cc. @fantes maybe I can take this, it should go through the supervised connector instead.

fantes commented 4 years ago

what do you mean "the pathway is wrong" ?

fantes commented 4 years ago

you mean it should not be filtered in tensorrlib and instead the supervisedouputconnector should do it?

fantes commented 4 years ago

in this case it seems the only thing to do is to remove code from tensorrtlib, i can handle it, i am tired of fighting against torch/c++ :)

YaYaB commented 4 years ago

Can you try "best":0 ? I believe we have a wrong test against 0 instad of < 0.

Yep it gives empty prediction

{"status":{"code":200,"msg":"OK"},"head":{"method":"/predict","service":"age","time":918.0},"body":{"predictions":[{"classes":[],"uri":"https://images.unsplash.com/photo-1580128660010-fd027e1e587a?ixlib=rb-1.2.1&ixid=eyJhcHBfaWQiOjEyMDd9&auto=format&fit=crop&w=500&q=60"}]}}

fantes commented 4 years ago

Hi this should be fixed by : https://github.com/jolibrain/deepdetect/pull/720 @YaYaB thank you a lot for the very precise bug report, it helps a lot for testing :)

YaYaB commented 4 years ago

Great, anytime :) I'll test it tonight and close the issue if it resolves everything on my side!

YaYaB commented 4 years ago

It fixes the issue on my side (tried with best equals to -1, 1 and several larger values)

jolibrain / deepdetect