Closed whatdhack closed 6 years ago
@whatdhack, The mode of preprocess_input
is not related to the backend. It stands for the library used in the original repository (e.g., VGG: caffe).
@taehoonlee , that is a good start. Can put more details in your answers. Your answers are cryptic ( and incomplete) . Looks like you are closing these issues for the sake of closing.
@whatdhack, The mode of preprocess_input
is originated from each model's original repository:
caffe
: 'RGB'->'BGR' followed by x -= [103.939, 116.779, 123.68]
(e.g., VGG16
and VGG19
),tf
: x /= 127.5
followed by x -= 1
(e.g., InceptionV3
and InceptionResNetV2
),torch
: x /= 255.
followed by x -= [0.485, 0.456, 0.406]
and x /= [0.229, 0.224, 0.225]
(e.g., DenseNet121
and DenseNet169
).If you are trying to deploy the pretrained networks, it is not an option to modify the mode designated for each model. Otherwise, you can choose another mode but it is not so recommended.
@taehoonlee , for the pre-trained models (e.g. VGG16) , it needs to be documented what modes are applicable. otherwise it takes time to run experiments and figure out what modes are available. For VGG16 it looks like, the pre-trained model can only handle 'caffe' mode.
A related question is the support across different backend uniform (i.e. does the 'caffe' mode work across TF, PyTorch , CNTK , etc . ) ?
@whatdhack, It doesn't need to be documented because end-users will not directly call imagenet_utils.preprocess_input
. And, the mode of preprocess_input
is about only normalization and orthogonal to the backend (i.e., caffe
mode works on all the backends).
@taehoonlee Really ? All the Keras applications examples use preprocess_input . See for yourself - vgg16
@whatdhack, Users will use a wrapper function preprocess_input(x)
rather than a imagenet_utils.preprocess_input(x, data_format, mode)
. The example also presents x = preprocess_input(x)
.
@taehoonlee , I am not sure I can take your word on this one as you did not even know usage of process_input() before I pointed that out. Documentation on preprocess_input() will clarify the confusion.
:)
Seems legit, but the naming is awfully confusing for someone looking into the code (for exploration or debugging purposes)
@whatdhack, The mode of
preprocess_input
is originated from each model's original repository:
caffe
: 'RGB'->'BGR' followed byx -= [103.939, 116.779, 123.68]
(e.g.,VGG16
andVGG19
),tf
:x /= 127.5
followed byx -= 1
(e.g.,InceptionV3
andInceptionResNetV2
),torch
:x /= 255.
followed byx -= [0.485, 0.456, 0.406]
andx /= [0.229, 0.224, 0.225]
(e.g.,DenseNet121
andDenseNet169
).If you are trying to deploy the pretrained networks, it is not an option to modify the mode designated for each model. Otherwise, you can choose another mode but it is not so recommended.
Wondering if the mode needs to be explicitly set to the backend I am using. For example in VGG16, preprocess_input () seems to pick the 'caffe' mode if I leave it unset , even if though my backend is Tensorflow. How would choosing one over the other would impact inference of the pre-trained VGG16 model ?