helblazer811 / ManimML

ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.
MIT License
2.35k stars 140 forks source link

2D Max Pooling Layer #16

Closed helblazer811 closed 1 year ago

helblazer811 commented 1 year ago

The max pooling visualization can work as follows:

  1. Draw boxes around the max pooling regions in the input feature maps.
  2. Randomly highlight a square from one of the boxes.
  3. Make an invisible set of reduced width and height output feature maps.
  4. Move each square to its corresponding location in the output feature map with an animation.
  5. Make the output feature map appear.

If this is too busy then maybe an alternative is:

  1. Draw boxes around the max pooling regions in the input feature maps.
  2. Make an invisible set of reduced width and height output feature maps.
  3. Resize and translate each square to the appropriate location in the output feature maps.
  4. Make the output feature map appear.
helblazer811 commented 1 year ago

This is done as of https://github.com/helblazer811/ManimML/commit/46958ea293e9d54cd0057fb6b0cb6458c331dabe