Official PyTorch implementation of the paper: "Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis" in [IEEE] (Accepted by IEEE TGRS 2024)
The overview of the MCI model:
Environment Installation:
Download Dataset:
Extract text files for the descriptions of each image pair in LEVIR-MCI:
python preprocess_data.py
After that, you can find some generated files in ./data/LEVIR_MCI/
.
Make sure you performed the data preparation above. Then, start training as follows:
python train.py --train_goal 2 --data_folder /DATA_PATH_ROOT/Levir-MCI-dataset/images --savepath ./models_ckpt/
python test.py --data_folder /DATA_PATH_ROOT/Levir-MCI-dataset/images --checkpoint {checkpoint_PATH}
We recommend training the model 5 times to get an average score.
Run inference to get started as follows:
python predict.py --imgA_path {imgA_path} --imgB_path {imgA_path} --mask_save_path ./CDmask.png
You can modify --checkpoint
of Change_Perception.define_args()
in predict.py
. Then you can use your own model, of course, you also can download our pretrained model MCI_model.pth
here: [Hugging face]. After that, put it in ./models_ckpt/
.
cd ./Change-Agent/lagent-main
pip install -e .[all]
Run Agent:
cd into the Multi_change
folder:
cd ./Change-Agent/Multi_change
(1) Run Agent Cli Demo:
# You need to install streamlit first
# pip install streamlit
python try_chat.py
(2) Run Agent Web Demo:
# You need to install streamlit first
# pip install streamlit
streamlit run react_web_demo.py
If you find this paper useful in your research, please consider citing:
@ARTICLE{Liu_Change_Agent,
author={Liu, Chenyang and Chen, Keyan and Zhang, Haotian and Qi, Zipeng and Zou, Zhengxia and Shi, Zhenwei},
journal={IEEE Transactions on Geoscience and Remote Sensing},
title={Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis},
year={2024},
volume={},
number={},
pages={1-1},
keywords={Remote sensing;Feature extraction;Semantics;Transformers;Roads;Earth;Task analysis;Interactive Change-Agent;change captioning;change detection;multi-task learning;large language model},
doi={10.1109/TGRS.2024.3425815}}
Thanks to the following repository:
This repo is distributed under MIT License. The code can be used for academic purposes only.
If you have any other questions❓, please contact us in time 👬