jiaqixuac / MAP-Net

[CVPR'23] Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior
MIT License
113 stars 10 forks source link
computer-vision dehazing image-restoration low-level-vision video-processing

MAP-Net

PyTorch implementation of MAP-Net, from the following paper:

Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior. CVPR 2023.\ Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, and Pheng-Ann Heng

We propose MAP-Net, a novel video dehazing framework that effectively explores the physical haze priors and aggregates temporal information.

Dataset

We construct a large-scale outdoor video dehazing benchmark dataset, HazeWorld, which contains video frames in various real-world scenarios.

To prepare the HazeWorld dataset for experiments, please follow the instructions.

Installation

This implementation is based on MMEditing, which is an open-source image and video editing toolbox.

python 3.10.9
pytorch 1.12.1
torchvision 0.13.1
cuda 11.3

Below are quick steps for installation.

Step 1. Install PyTorch following official instructions.

Step 2. Install MMCV with MIM.

pip3 install openmim
mim install mmcv-full

Step 3. Install MAP-Net from source.

git clone https://github.com/jiaqixuac/MAP-Net.git
cd MAP-Net
pip3 install -e .

Please refer to MMEditing Installation for more detailed instruction.

Getting Started

You can train MAP-Net on HazeWorld using the below command with 4 GPUs:

bash tools/dist_train.sh configs/dehazers/mapnet/mapnet_hazeworld.py 4

Evaluation

We mainly use psnr and ssim to measure the model performance. For HazeWorld, we compute the dataset-averaged video-level metrics; see the evaluate function.

You can use the following command with 1 GPU to test your trained model xxx.pth:

bash tools/dist_test.sh configs/dehazers/mapnet/mapnet_hazeworld.py xxx.pth 1

You can find one model checkpoint trained on HazeWorld here.

Results

Demo for the real-world hazy videos.

https://user-images.githubusercontent.com/33066765/224627919-cdc91886-9ab3-4b51-873b-3596c4aea085.mp4

For the REVIDE dataset, the visual results of MAP-Net can be downloaded here.

Acknowledgement

This repository is built using the mmedit and mmseg toolboxes, DAT and STM repositories.

Citation

If you find this repository helpful to your research, please consider citing the following:

@inproceedings{xu2023map,
  title     = {Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior},
  author    = {Jiaqi Xu and Xiaowei Hu and Lei Zhu and Qi Dou and Jifeng Dai and Yu Qiao and Pheng-Ann Heng},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year      = {2023},
}

License

This project is released under the MIT license. Please refer to the acknowledged repositories for their licenses.