PiLab-CAU / ImageProcessing-2402

Image processing repo
MIT License
0 stars 1 forks source link

[Lecture2-2][1009] Convolutional Autoencoder vs. (Deep) CNN #11

Open mosouka opened 13 hours ago

mosouka commented 13 hours ago

In the lecture, we discussed the use of convolutional layers in autoencoders. I previously studied deep convolutional neural networks (CNNs), and I'm trying to understand the distinctions between the two. Based on my research, I noted that:

  1. CNNs typically have more layers (at least five), while autoencoders can function with fewer layers, even one, although deeper architectures tend to perform better.
  2. Both are feedforward networks, but autoencoders may also utilize recirculation during training.
  3. Autoencoders have a broader range of applications beyond image processing, while CNNs are generally used for image-related tasks.

Could you clarify these differences or provide more insights into how autoencoders and CNNs differ in their structure and applications?

Thank you in advance!

Cosima Balzer

jleem99 commented 11 hours ago

I think you can simply think of it this way:

While the encoder and decoder in autoencoders are often built with fully connected layers, they can also be formed with convolutional layers like in the "convolutional" autoencoder, that is, CNN being part of autoencoder architecture.

yjyoo3312 commented 5 hours ago

@mosouka @jleem99 Thanks for the question and the comment:)

jleem's answers are correct. To add to that:

Thus, CNNs and Autoencoders are not inherently in the same category. While we can design an Autoencoder using a CNN architecture, convolutional layers are widely used across various applications in computer vision.