Lidar-Camera Semi-Supervised Learning for Semantic Segmentation - Data splits

Public release of our data splits used for the results in the paper:

@ARTICLE{caltagirone2021dataset,
    author = {Caltagirone, Luca and Bellone, Mauro and Lennart, Svenson and Mattias, Wahde and Raivo, Sell},
     title = {Lidar-Camera Semi-Supervised Learning for Semantic Segmentation},
   journal = {MDPI Sensors},
    volume = {21},
    number = {14},
      year = {2021},
       url = {https://www.mdpi.com/1424-8220/21/14/4813},
       doi = {10.3390/s21144813},
     pages = {000--000}
}

Dataset splits

In our paper, we used the Waymo dataset which consists of 1110 driving sequences collected under various weather and lighting conditions. The Waymo open dataset includes 1110 driving sequences recorded with multiple cameras and lidars across a large variety of locations, road types and weather and lighting conditions. Each driving sequence consists of a 20-s-long recording sampled at 10 Hz. Both 2D and 3D bounding boxes were manually generated for all frames and considering the following four categories of objects: vehicles; pedestrians; cyclists; and traffic signs. Additionally, we have partitioned the driving sequences into four broad subsets, namely day–fair; night–fair; day–rain; night–rain (see Table below for further details).

Day–Fair	Day–Rain	Night–Fair	Night–Rain
747	226	82	45

table: Number of 20-s-long driving sequences belonging to the four main categories considered in this work (10 Hz sample frequency).

The labels day and night indicate whether a sequence was collected during the day under good lighting conditions, or late in the day or at night under poor external illumination. The labels fair and rain instead refer to the weather conditions, with fair denoting good weather, and rain denoting active raining or wet environment following recent precipitation. Each sequence originally contains approximately 200 frames (i.e., 20 s sampled at 10 Hz) of which we kept every 10th frame. Out of the full dataset, for~the experiments described in the following sections, we generated N=10 random dataset splits. The randomization was carried out by considering driving sequences instead of individual frames in order to avoid any overlap between the subsets in any given set.

With the exception of the training sets, which only contain examples belonging to the coarse category day--fair, all other sets include examples belonging to all coarse categories. By only considering training data belonging to one category, it is possible to investigate the effectiveness of the proposed approach for carrying out domain adaptation. (See Table below for more details regarding the dataset splits S.)

Set	Day–Fair	Day–Rain	Night–Fair	Night–Rain
Training	100	0	0	0
Validation	10	10	10	5
Unlabelled	40	40	36	20
Test	40	40	36	20

table: Number of sequences assigned to training, validation, unlabelled, and test subsets for each dataset split Si according to the four main categories considered in this work.

The specific lists are provided in the folder ./data/splits/. as text files

./data/splits/

The list files contain rows in the following format:

labeled/day/not_rain/camera/segment-575209926587730008_3880_000_3900_000_with_camera_labels_0000000124.png

where "day", "not_rain" etc. refer to the specific split in the table above, while "segment-XXXXX-.png" refers to the specific image from the waymo dataset. File names follow the same standard and the same format.

bellonemauro / LCSSLSS-DataSplits

readme

Lidar-Camera Semi-Supervised Learning for Semantic Segmentation - Data splits

Dataset splits