Text-based-traffic-understanding

The dataset and code of the KDD23 paper 'A Study of Situational Reasoning for Traffic Understanding' See the full paper here

Alt text

Datasets description

Introduction

This repository contains four datasets: tv.jsonl, bdd.jsonl, hdt.jsonl and manuals.jsonl. All datasets are structured in a JSON Lines format, where each line is a separate JSON object corresponding to a single data point or example.

Dataset Structures

Complex-TV-QA Dataset

The Complex-TV-QA dataset, to our knowledge, is the inaugural resource that provides human-annotated, detailed video captions within traffic scenarios, alongside complex reasoning questions. This novel dataset not only stands as a vital tool for evaluating language models in real-world video-QA and video-reasoning research, but also offers valuable insights for the development and understanding of multi-modal video reasoning models and related works.

The tv.jsonl dataset has the following fields:

description: A detailed description of a traffic video
question: A question that is related to the video
candidates: A list of text strings representing possible answers to the question
answer: An integer that serves as an index into the list of candidates, corresponding to the correct answer
class: The reasoning types defined in the original Traffic-QA paper
video_file: The original video file from the Traffic-QA dataset, you may request the download from here
id: Indentifier

BDD-QA Dataset

BDD-QA is distinguished by its encompassing range of traffic actions, crafted to rigorously evaluate a model's decision-making abilities in traffcu senario. This makes it a potent tool for high-level decision-making research within traffic contexts, including autonomous driving developments.

The bdd.jsonl dataset has the following fields:

question: A text string which briefly describe a traffic senario and a follow up question
candidates: A list of text strings representing possible answers to the question
answer: An integer that serves as an index into the list of candidates, corresponding to the correct answer
class: The action class that this question is related to, which is defined in the paper
id: Indentifier

HDT-QA Dataset

HDT-QA, coupled with driving manuals, offers an extensive compendium of driving instructions and driving knowledge tests across all 51 states of the US. This resource is beneficial for assessing the incorporation and impact of traffic knowledge within intelligent driving systems, marking a crucial stride towards more advanced, informed, and safe autonomous driving technology.

The hdt.jsonl dataset has the following fields:

question: A text string representing a question, typically related to driving rules or scenarios.
candidates: A list of text strings representing possible answers to the question.
answer: An integer that serves as an index into the list of candidates, corresponding to the correct answer.
state: A text string indicating the state from which the question was sourced.
type: A text string indicating the type of driving test the question pertains to (e.g., permit, motorcycle, CDL).
id: Indentifier

The manuals.jsonl dataset is a large collection of driving manuals from 51 states of the US, which is oringally crawled from DMV TEST PRO, which contains the following keys:

text: A paragraph from the original DMV driving manual
state: The state of the source of this driving manual
domain: The knowledge domain of this driving manual, could be one of: CDL, permit, motorcycle

Evaluation Scripts

In the code folder we have the evaluation code for the experiments in the paper.

Dependencies

Firstly, clone this repo to you local folder:

git clone https://github.com/saccharomycetes/text-based-traffic-understanding.git
cd text-based-traffic-understanding

Then you can install the required packages by

pip install -r requirements.txt

Please ensure these are installed before running the script.

Running the QA and retrieval-QA evaluation script

The script can be run using the following command:

python qa_retrieve_eval.py 
--data_dir [DATA_DIR]
--output_dir [OUTPUT_DIR]
--num_related [NUM_RELATED]
--corpus_file [CORPUS_FILE]
--model_dir [MODEL_DIR]

Replace the bracketed terms with the appropriate paths or values:

[DATA_DIR]: Directory of the data file that you want to evaluate
[OUTPUT_DIR]: Directory where the results will be saved
[NUM_RELATED]: number of retrieved documents you want to retrieve, set it to 0 if you want to run the QA-only evaluation
[CORPUS_FILE]: Path to the corpus file，which is our manuals.jsonl dataset
[MODEL_DIR]: Directory of the pre-trained model, in our evaluation, we use the Unified-QA-V2

Running the KG model evaluation script

The script can be run using the following command:

python kg_eval.py 
--data_dir [DATA_DIR] 
--output_dir [OUTPUT_DIR] 
--model_dir [MODEL_DIR] 
--eval_batch_size [EVAL_BATCH_SIZE]

Replace the bracketed terms with the appropriate paths or values:

[DATA_DIR]: Directory of the data file that you want to evaluate
[OUTPUT_DIR]: Directory where the results will be saved
[MODEL_DIR]: Directory of the pre-trained model (we haven't uploaded this model for now, will be released soon)
[EVAL_BATCH_SIZE]: Batch size for evaluation

Running the NLI model evaluation script

The script can be run using the following command:

python nli_eval.py 
--data_dir [DATA_DIR] 
--output_dir [OUTPUT_DIR] 
--model_dir [MODEL_DIR] 
--eval_batch_size [EVAL_BATCH_SIZE]

Replace the bracketed terms with the appropriate paths or values:

[DATA_DIR]: Directory of the data file that you want to evaluate
[OUTPUT_DIR]: Directory where the results will be saved
[MODEL_DIR]: Directory of the pre-trained model, for our NLI evaluation, we use the Roberta-large-mnli
[EVAL_BATCH_SIZE]: Batch size for evaluation

Cite

If you find our dataset or code to be useful in your research, please consider citing the following paper:

@article{zhang2023study,
  title={A Study of Situational Reasoning for Traffic Understanding},
  author={Zhang, Jiarui and Ilievski, Filip and Ma, Kaixin and Kollaa, Aravinda and Francis, Jonathan and Oltramari, Alessandro},
  journal={arXiv preprint arXiv:2306.02520},
  year={2023}
}

Contact

jrzhang [AT] isi.edu

saccharomycetes / text-based-traffic-understanding

readme

Text-based-traffic-understanding

Datasets description

Introduction

Dataset Structures

Complex-TV-QA Dataset

BDD-QA Dataset

HDT-QA Dataset

Evaluation Scripts

Dependencies

Running the QA and retrieval-QA evaluation script

Running the KG model evaluation script

Running the NLI model evaluation script

Cite

Contact