Open late347 opened 3 years ago
Thanks for the warning. There is no bad labels in the annotations in the data. Only the coordinates are written as returned by the face detector.
You can very easily limit the coordinates to the allowed values, as I did in the code: show_save_gt.m in line: 53-56
xmin = max (1, xmin); ymin = max (1, ymin); xmax = min (width, xmax); ymax = min (hight, ymax);
@borutb-fri
Did you remove some of the bounding boxes from widerface? It seems that some of the images have missing bounding boxes.
I use only the subset from the Wider Face dataset. For original bounding boxes of widerface refer to WIDERFACE.
@borutb-fri
Have you ever compared using image classifier and object detector on the same training dataset?
issue 1: These was not necessarily the only bad data I found. But it was what I found from my dataset when I inspected the csv file, after having gotten crash from illegal bounding box outside image dimensions.
File "C:\Users\lauri\anaconda3\envs\tf_kaggle\lib\site-packages\tensorflow\python\ops\control_flow_ops.py", line 154, in Assert raise errors.InvalidArgumentError( tensorflow.python.framework.errors_impl.InvalidArgumentError: Expected 'tf.Tensor(False, shape=(), dtype=bool)' to be true. Summarized data: b'maximum box coordinate value is larger than 1.100000: ' 1.1067415
The ones I found from my subset of your FMLD dataset with sketchy bbox coordinates were these in csv
filename test_00000749.jpg test_00001127.jpg test_00001586.jpg test_00001607.jpg test_00001626.jpg test_00001626.jpg test_00003030.jpg test_00003055.jpg test_00003672.jpg test_00003750.jpg test_00003774.jpg test_00004007.jpg test_00004179.jpg filename train_00000019.jpg
INVALID_COORDINATES_excelsheet_test_train.xlsx
Issue 2: I found Hitler the dataset. I think he was maybe hiding in the WIDERFACE. picture name 0_Parade_Parade_0_914.jpg But I managed to delete him from my dataset as well :)