This repository implements the PrOtotypical nEural Module network. It contains four key components, stored in the following directories:
Please refer to the README in each directory for details.
We adopt the official implementation of the XNM as the backbone model for prototypical reasoning. We use the bottom-up features provided in the following repos: for VQA and for GQA. Please refer to these links for further README information.
If you use our code or data, please cite our paper:
@InProceedings{poem,
author = {Chen, Shi and Zhao, Qi},
title = {Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning},
booktitle = {CVPR},
year = {2023}
}