ChannelPrunedLearner don't compress the model

Tencent / PocketFlow

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

https://pocketflow.github.io

Other

2.79k stars 490 forks source link

ChannelPrunedLearner don't compress the model #190

Open as754770178 opened 5 years ago

as754770178 commented 5 years ago

I prune resnet_20 at cifar_10 by ChannelPrunedLearner, but I reader original_model.ckpt, pruned_model.ckpt and best_model.ckpt, the size of weight is same. origin val is the size of weight in original_model.ckpt, pruned val is the size of weight in pruned_model.ckpt or best_model.ckpt.

model/resnet_model/conv2d_8/kernel
origin val: [1, 1, 16, 32]
pruned val: [1, 1, 16, 32]
model/resnet_model/batch_normalization_2/gamma
origin val: [16]
pruned val: [16]
model/resnet_model/batch_normalization_18/moving_mean
origin val: [64]
pruned val: [64]
model/resnet_model/batch_normalization_6/beta
origin val: [16]
pruned val: [16]

command : ./scripts/run_local.sh nets/resnet_at_cifar10_run.py --learner channel modle: https://api.ai.tencent.com/pocketflow/models_resnet_20_at_cifar_10.tar.gz

jiaxiang-wu commented 5 years ago

ChannelPrunedLearner does not remove pruned models in the .ckpt files. Instead, we only set the corresponding channel's weights to zeros. They are removed in the .pb & *.tflite models. Therefore, the size of weights should be the same.

as754770178 commented 5 years ago

Can you tell me in which you convert the model? I did't find the pb models in cp_best_path and cp_channel_pruned_path.

jiaxiang-wu commented 5 years ago

.pb & .tflite models are generated using tools/conversion/export_chn_pruned_tflite_model.py.

as754770178 commented 5 years ago

In __build_pruned_train_model the collection of input and output is train_images and logits, the code show as below:

      logits = tf.get_collection('logits')[0]
      train_images = tf.get_collection('train_images')[0]
      train_labels = tf.get_collection('train_labels')[0]
      mem_images = tf.get_collection('mem_images')[0]
      mem_labels = tf.get_collection('mem_labels')[0]

In tools/conversion/export_chn_pruned_tflite_model.py, the parameter input_coll and output_coll should be train_images and logits?

jiaxiang-wu commented 5 years ago

You need to export *.ckpt saved from the evaluation graph (defined in __build_pruned_evaluate_model()), instead of the training graph.

as754770178 commented 5 years ago

If the structure of model has not changed, can the code self.saver_train = tf.train.import_meta_graph(path + '.meta') in __build_pruned_train_model be changed to

 # model definition
  with tf.variable_scope(self.model_scope):
    # forward pass
    logits = self.forward_train(mem_images)
    loss, accuracy = self.calc_loss(mem_labels, logits, self.trainable_vars)
    self.accuracy_keys = list(accuracy.keys())
    for key in self.accuracy_keys:
      tf.add_to_collection(key, accuracy[key])
    tf.add_to_collection('loss', loss)
    tf.add_to_collection('logits', logits)

The eval model is only used in __train_pruned_model, whether self.__build(is_train=False) is useless in __init__?

jiaxiang-wu commented 5 years ago

For your above two questions, we need to review the detailed implementation of ChannelPrunedLearner in the next few days, and refactor it if needed.