sutdcv / SUTD-TrafficQA

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
https://sutdcv.github.io/SUTD-TrafficQA/
51 stars 2 forks source link
annotations cvpr cvpr2021 dataset multimodal multimodal-deep-learning paper traffic-events video-qa video-reasoning vqa vqa-dataset

SUTD-TrafficQA

A challenging Video Question Answering (VQA) Benchmark based on real-world traffic scenes.

Updates:

Paper

Our paper at CVPR 2021, SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events, is available at: [CVF Open Access], [arXiv:2103.15538], and [ResearchGate].

Dataset

Citation

@InProceedings{Xu_2021_CVPR,
    author    = {Xu, Li and Huang, He and Liu, Jun},
    title     = {{SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning Over Traffic Events}},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {9878-9888}
}

Acknowledgment

Contributors: Lin Yutian, Tran Nguyen Bao Long, Liu Renhang, Qiao Yingjie, Xun Long Ng, Koh Kai Ting, Christabel Dorothy

Code Reference: thaolmk54 / hcrn-videoqa

Contact