How to obtain the reconstructed image for inference and masked

wgcban / adamae

[CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders

https://www.wgcban.com/research/adamae

MIT License

72 stars 8 forks source link

How to obtain the reconstructed image for inference and masked #3

Closed hzxie99 closed 10 months ago

hzxie99 commented 10 months ago

Hello, I very much agree with your work. I would like to know how to obtain the schematic diagram of the reconstructed image and the mask image during the experiment. Because I'm just getting started. I really appreciate it if you can help me with this question.

wgcban commented 10 months ago

@hzxie99 Thanks for your interest!.

I wrote a python notebook (which I cannot locate now, but similar to one used in image MAEs https://colab.research.google.com/github/facebookresearch/mae/blob/main/demo/mae_visualize.ipynb) which

first saves the predicted image and mask into numpy array
then visualize each frame in the video by re-arranging and re-normalizing and then saving it as png file

I will try to find my script and post it here. But, I would say it is a modification on top of the above python notebook file.

hzxie99 commented 10 months ago

Thank you very much. The pictures in the paper are very beautiful. Thank you!!

hzxie99 commented 10 months ago

@hzxie99 Thanks for your interest!.

I wrote a python notebook (which I cannot locate now, but similar to one used in image MAEs https://colab.research.google.com/github/facebookresearch/mae/blob/main/demo/mae_visualize.ipynb) which

first saves the predicted image and mask into numpy array

then visualize each frame in the video by re-arranging and re-normalizing and then saving it as png file

I will try to find my script and post it here. But, I would say it is a modification on top of the above python notebook file.

I'm sorry to bother you, but I want to ask if you have found your notes and could you share an example? My ability to draw 3d mask reconstruction images is limited