Closed vdumoulin closed 6 years ago
A take on understanding FiLM-ed networks
still reads as somewhat fragmentedare likely to share a good amount of computation when mapping from the abstract noise vector to the output image
. Consider rephrasing good amount
In the interest of not being confusing to readers already familiar with these methods, we chose to stick to the nomenclature used in the original papers, but we do draw connections to the FiLM nomenclature where appropriate
. Consider removing being
.In the visual domain, the ImageNet 2017 winning model [20] employs a self-conditioning scheme in the form of feature-wise sigmoidal gating as a way to condition a layer’s activations on its previous layer.
I'm still not completely sure that this description of squeeze-excitation is accurate - did you revisit? Going backwards from the definition of each computation mechanism, we will now explain how they can be expressed in terms of generalized bilinear transformations.
Might be better as Starting with the mathematical definition of each computation, we will now explain how they can be expressed in terms of generalized bilinear transformations
s/side/contextual