NorbertZheng / read-papers

My paper reading notes.
MIT License
8 stars 0 forks source link

Sik-Ho Tang | Review: SqueezeNet (Image Classification). #99

Closed NorbertZheng closed 1 year ago

NorbertZheng commented 1 year ago

Sik-Ho Tang. Review: SqueezeNet (Image Classification).

NorbertZheng commented 1 year ago

Overview

In this story, SqueezeNet, by DeepScale, UC Berkeley and Stanford University, is reviewed. With equivalent accuracy, smaller CNN architectures offer at least three advantages:

This is a technical report on arXiv in 2016 with over 1100 citations.

NorbertZheng commented 1 year ago

Architectural Design Strategies

Strategy 1. Replace 3×3 filters with 1×1 filters

Given a budget of a certain number of convolution filters, we can choose to make the majority of these filters 1×1, since a 1×1 filter has 9× fewer parameters than a 3×3 filter.

Strategy 2. Decrease the number of input channels to 3×3 filters

Consider a convolution layer that is comprised entirely of 3×3 filters. The total quantity of parameters in this layer is:

We can decrease the number of input channels to 3×3 filters using squeeze layers, mentioned in the next section.

Strategy 3. Downsample late in the network so that convolution layers have large activation maps

The intuition is that large activation maps (due to delayed downsampling) can lead to higher classification accuracy.

Summary

NorbertZheng commented 1 year ago

Fire Module

image Fire Module with hyperparameters: s1x1 = 3, e1x1 = 4, and e3x3 = 4.

NorbertZheng commented 1 year ago

SqueezeNet Architecture

image SqueezeNet (Left), SqueezeNet with simple bypass (Middle), SqueezeNet with complex bypass (Right).

image Details of SqueezeNet Architecture.

NorbertZheng commented 1 year ago

Evaluation of SqueezeNet

Comparing SqueezeNet to model compression approaches

image Comparing SqueezeNet to model compression approaches.

NorbertZheng commented 1 year ago

Hyperparameters

image Different Hyperparameter Values for SqueezeNet.

NorbertZheng commented 1 year ago

SqueezeNet Variants

image SqueezeNet accuracy and model size using different macroarchitecture configurations.

NorbertZheng commented 1 year ago

Reference