Rethinking Spatial Invariance of Convolutional Networks for Object Counting

Highlights

🏆 2022 Pulitzer Prize for Public Service: Application in analyzing the Capitol Riot earned a Pulitzer Prize for public service.
🎬 Watch the YouTube Demonstration.
📰 Read about it on the Washington Post.

In this project, we explore a unique approach to convolutional networks and object counting. This repository contains the key implementation of our paper, Rethinking Spatial Invariance of Convolutional Networks for Object Counting. Here, we introduce the GauNet implementation in C++ and CUDA, supplemented by a TensorFlow plugin. Dive into our innovative method of using locally connected Gaussian kernels to elevate the process of object counting in convolutional networks. Developed with support from Vitjan Zavrtanik (VitjanZ). Developed with support from Vitjan Zavrtanik (VitjanZ). Many implementations of this version are from C^3 Framework and DAU-ConvNet.

Quick Start

TensorFlow Plugin

Easily replace the tf.contrib.layers.conv2d function using our TensorFlow plugin and Python wrappers.

Dependencies:

Python2.7 or Python3.5
TensorFlow 1.6+
Numpy
OpenBlas

Optional:

Scipy
Matplotlib
Python-tk for unit tests

# Install the required dependencies
sudo apt-get install python2.7 python3.5 python-pip python3-pip
pip install tensorflow==1.6 numpy openblas

Installation

For a quick setup, use the pre-compiled binaries for TensorFlow:

# Install further dependencies
sudo apt-get install libopenblas-dev wget

# Set up the package
export TF_VERSION=1.13.1
sudo pip install link_to_dau_conv_package

For Docker users, access our pre-built images on Docker Hub. To manually build and install, please refer to the detailed steps in the original DAU-ConvNet.

Training & Testing

Training:

Set the parameters in config.py and ./datasets/XXX/setting.py. If you wish to reproduce our results, we recommend using our parameters in ./results_reports.
Run the command: python train.py.
Monitor the training with TensorBoard using: tensorboard --logdir=exp --port=6006.

Testing:

We only provide an example to test the model on the test set. You may need to modify it to test your own models.

Note: The training and testing instructions align with the ones used in the C^3 Framework. You can follow similar commands as specified in that framework.

Acknowledgments

Our heartfelt appreciation goes to:

Vitjan Zavrtanik (VitjanZ) for his pivotal support in developing the TensorFlow version of our project.
The teams behind C^3 Framework and DAU-ConvNet. Their work has significantly influenced our TensorFlow implementation.

Citing Our Work

If you use our work in your research, kindly cite our paper:

@InProceedings{Cheng_2022_CVPR,
    author    = {Cheng, Zhi-Qi and Dai, Qi and Li, Hong and Song, Jingkuan and Wu, Xiao and Hauptmann, Alexander G.},
    title     = {Rethinking Spatial Invariance of Convolutional Networks for Object Counting},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
    month     = {June},
    year      = {2022},
    pages     = {19638-19648}
}

License

GauNet is licensed under the Apache 2.0 license.

Disclaimer: Due to the presence of sensitive information, modifications were made to this repository. Unfortunately, the PyTorch version is not available at this time. Thank you for your understanding.

zhiqic / Rethinking-Counting

readme