gkordo / s2vs

Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]
MIT License
37 stars 2 forks source link
duplicate-videos fivr ndvr self-supervised-learning self-supervision video-detection video-retrieval video-search video-similarity video-similarity-learning video-similarity-search

Self-Supervised Video Similarity Learning

This repository contains the PyTorch implementation of the paper Self-Supervised Video Similarity Learning. It contains code for the training of video similarity learning network with self-supervision. Also, to facilitate the reproduction of the paper's results, the evaluation code, the extracted features for the employed video datasets, and pre-trained models are provided.

Prerequisites

Preparation

Installation

Training

Evaluation

Use our pretrained models

feat_extractor = torch.hub.load('gkordo/s2vs:main', 'resnet50_LiMAC') s2vs_dns = torch.hub.load('gkordo/s2vs:main', 's2vs_dns') s2vs_vcdb = torch.hub.load('gkordo/s2vs:main', 's2vs_vcdb')


## Citation
If you use this code for your research, please consider citing our papers:
```bibtex
@inproceedings{kordopatis2023s2vs,
  title={Self-Supervised Video Similarity Learning},
  author={Kordopatis-Zilos, Giorgos and Tolias, Giorgos and Tzelepis, Christos and Kompatsiaris, Ioannis and Patras, Ioannis and Papadopoulos, Symeon},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year={2023}
}

@inproceedings{kordopatis2019visil,
  title={{ViSiL}: Fine-grained Spatio-Temporal Video Similarity Learning},
  author={Kordopatis-Zilos, Giorgos and Papadopoulos, Symeon and Patras, Ioannis and Kompatsiaris, Ioannis},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year={2019}
}

Visualization

For visualization examples of augmentation and similarity matrices, as well as model usage in code, have a look at this Colab notebook.

Related Projects

DnS - computational efficiency w/ selector network

ViSiL - original ViSiL approach

FIVR-200K - download our FIVR-200K dataset

License

This project is licensed under the MIT License - see the LICENSE file for details

Contact for further details

Giorgos Kordopatis-Zilos (kordogeo@fel.cvut.cz)