Open haeun0814 opened 1 month ago
@haeun0814 Thank you for the detailed comment! :) As you commented, n the encoder (p13) , In encoder, the convolutional layers squeezes the feature map size (width and height). in the decoder (p33) , they expand the feature (feature map) size (exactly, the width and height).
Additionally, it would be helpful for testing AE code and check the feature dimension changes, in the link .
p13. Because it is Encoder, I think "What if we expand the input size?" should be "What if we squeeze the input size?".
p.33 Also becuase it is Decoder, I think "Then, how to expand the feature dimension?" should be "Then, how to expand the feature size?"