hi, what is the format of training-datasets ground truth ? Is it similar to the ground truth of detection or semantic segmentation? Should the location of each word be labeled? Can this model be used to do semantic segmentation tasks only? Thank you very much!
hi, what is the format of training-datasets ground truth ? Is it similar to the ground truth of detection or semantic segmentation? Should the location of each word be labeled? Can this model be used to do semantic segmentation tasks only? Thank you very much!