cxliu0 / PET

[ICCV 2023] Point-Query Quadtree for Crowd Counting, Localization, and More
MIT License
53 stars 5 forks source link
crowdcounting object-counting

Point-Query Quadtree for Crowd Counting, Localization, and More (ICCV 2023)

This repository includes the official implementation of the paper:

Point-Query Quadtree for Crowd Counting, Localization, and More

International Conference on Computer Vision (ICCV), 2023

Chengxin Liu1, Hao Lu1, Zhiguo Cao1, Tongliang Liu2

1Huazhong University of Science and Technology, China

2The University of Sydney, Australia

[Paper] | [Supplementary]

PET

Highlights

We formulate crowd counting as a decomposable point querying process, where sparse input points could split into four new points when necessary. This formulation exhibits many appealing properties:

Installation

torch
torchvision
numpy
opencv-python
scipy
matplotlib
pip install -r requirements.txt

Data Preparation

PET
├── data
│    ├── ShanghaiTech
├── datasets
├── models
├── ...

Training

Evaluation

sh eval.sh

Pretrained Models

Dataset Model Link Training Log MAE
ShanghaiTech PartA SHA_model.pth SHA_log.txt 49.08
ShanghaiTech PartB SHB_model.pth SHB_log.txt 6.18

Citation

If you find this work helpful for your research, please consider citing:

@InProceedings{liu2023pet,
  title={Point-Query Quadtree for Crowd Counting, Localization, and More},
  author={Liu, Chengxin and Lu, Hao and Cao, Zhiguo and Liu, Tongliang},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  year={2023}
}

Permission

This code is for academic purposes only. Contact: Chengxin Liu (cx_liu@hust.edu.cn)

Acknowledgement

We thank the authors of DETR and P2PNet for open-sourcing their work.