Open MbProg opened 5 years ago
No, the book is correct. The term 'window' here doesn't mean 'filter size'. Chollet wants to say that the information from each 7x7 'zone' of the 28x28 input image is seen only by 3x3 'zone' in the third layer. With usage of MaxPooling layers the size of the 'zone' of the input image from which the third layer 'zone' receives information would be much larger.
On this page it states about not using Max Pooling:
I don't understand where the 7x7 window comes from. The output of the second layer is 24x24. Is that a mistake in the book?