666DZY666 / micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape
MIT License
2.2k stars 478 forks source link

你好,请问能对Inception这种网络进行压缩吗? #8

Open 1448643857 opened 4 years ago

1448643857 commented 4 years ago

感谢作者能够提供这么好的代码给我们参考。

666DZY666 commented 4 years ago

可以,更改网络结构就行(不过压缩的时候根据网络结构要微调,比如有1X1,3X3,5X5的filter,1X1 filter压缩的应该要少一些)

1448643857 commented 4 years ago

好的,太感谢啦

1448643857 commented 4 years ago

作者你好,我还有一个疑问,就是诸如inception-v3,v4这种网络,里面含有7x1和1x7这种卷积核,请问这种怎么对其进行通道剪枝呢,因为两者之间是相互联系的0.0

Blue-Eagle-10 commented 4 years ago

@666DZY666 ,您好!

感谢您的分享,请问这个工程可用于YOLOv3这种目标检测模型的压缩么?如果可以,我需要从哪部分修改呢?谢谢!