zhangbin-ai / APL

APL for AVQA task
2 stars 1 forks source link

Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering(AAAI'2024) [arXiv]

Authors: Zhangbin Li, Dan Guo, Jinxing Zhou, Jing Zhang, and Meng Wang

Requirements

   python3.7 +
   pytorch1.7.1
   numpy
   ast

Usage

  1. Cloning this repo

    git clone https://github.com/zhangbin-ai/APL.git
  2. Getting Started

    • Training

      python train.py \
      --batch-size 64 \
      --epochs 30 \
      --lr 1e-4 \
      --gpu 0 \
      --checkpoint APL_dir \
      --checkpoint_file checkpoint01 \
      --save_model_flag True \
    • Testing

      python test.py
  3. Citing Us

    If you find this work useful, please consider citing it.

    @article{li2023object,
     title={Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering},
     author={Li, Zhangbin and Guo, Dan and Zhou, Jinxing and Zhang, Jing and Wang, Meng},
     journal={arXiv preprint arXiv:2312.12816},
     year={2023}
    }
  4. Acknowledgement

    The computation is supported by the HPC Platform of Hefei University of Technology.