like413 / VisTA

[arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
https://like413.github.io/CDQAG/
Other
1 stars 0 forks source link

Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection

🏠 [Project page]πŸ“„ [arXiv]πŸ’Ύ [Dataset Download]

This repository is the official implementation:

Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
Ke Li, Fuyu Dong, Di Wang, Shaofeng Li, Quan Wang, Xinbo Gao, Tat-Seng Chua

Abstract

Remote sensing change detection aims to perceive changes occurring on the Earth’s surface from remote sensing data in different periods, and feed these changes back to humans. However, most existing methods only focus on detecting change regions, lacking the ability to interact with users to identify changes that the users expect. In this paper, we introduce a new task named Change Detection Question Answering and Grounding (CDQAG), which extends the traditional change detection task by providing interpretable textual answers and intuitive visual evidence. To this end, we construct the first CDQAG benchmark dataset, termed QAG-360K, comprising over 360K triplets of questions, textual answers, and corresponding high-quality visual masks. It encompasses 10 essential land-cover categories and 8 comprehensive question types, which provides a large-scale and diverse dataset for remote sensing applications. Based on this, we present VisTA, a simple yet effective baseline method that unifies the tasks of question answering and grounding by delivering both visual and textual answers. Our method achieves state-of-the-art results on both the classic CDVQA and the proposed CDQAG datasets. Extensive qualitative and quantitative experimental results provide useful insights for the development of better CDQAG models, and we hope that our work can inspire further research in this important yet underexplored direction.


## πŸ”₯ Benchmark dataset QAG-360K

## 🌟 Simple Baseline Model VisTA

## 🌈 Results QAG-360K

CDVQA

## πŸ™ Acknowledgement The dataset is based on [HiUCD](https://github.com/Daisy-7/Hi-UCD-S), [SECOND](https://captain-whu.github.io/SCD/), [LEVIR-CD](https://chenhao.in/LEVIR/), and [CDVQA](https://github.com/YZHJessica/CDVQA). The code is based on [CRIS](https://github.com/DerrickWang005/CRIS.pytorch). We thank the authors for their open-sourced datasets and codes and encourage users to cite their works when applicable. ## πŸš€ Citation If you use our data or code in your research or find it is helpful, please cite this project. ```bibtex @misc{li2024changedquestionansweringgrounding, title={Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection}, author={Ke Li and Fuyu Dong and Di Wang and Shaofeng Li and Quan Wang and Xinbo Gao and Tat-Seng Chua}, year={2024}, eprint={2410.23828}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2410.23828}, } ``` ## License Licensed under a [Creative Commons Attribution-NonCommercial 4.0 International](https://creativecommons.org/licenses/by-nc/4.0/) for Non-commercial use only. Any commercial use should get formal permission first.